Fundamentals of Wireless Communication 12 

David Tse, University of California, Berkeley 
Prarnod Viswanath, University of Illinois, Urbana-Champaign 

August 3, 2004 



1 Draft. Comments will be much appreciated; please send them to dtse@eecs.berkeley.edu 
or pramodv@uiuc.edu. Please do not distribute the notes without the authors’ consent. 

2 Section 1.2 and Chapter 2 are modified from R. G. Gallager’s notes for the MIT course 
6.450. 




Contents 



1 Introduction and Book Overview 7 

1.1 Book Objective 7 

1.2 Wireless Systems 8 

1.3 Book Outline 11 

2 The Wireless Channel 15 

2.1 Physical Modeling for Wireless Channels 15 

2.1.1 Free space, fixed transmitting and receive antennas 17 

2.1.2 Free space, moving antenna 18 

2.1.3 Reflecting wall, fixed antenna 19 

2.1.4 Reflecting wall, moving antenna 21 

2.1.5 Reflection from a Ground Plane 23 

2.1.6 Power Decay with Distance and Shadowing 24 

2.1.7 Moving Antenna, Multiple Reflectors 25 

2.2 Input/Output Model of the Wireless Channel 26 

2.2.1 The Wireless Channel as a Linear Time- Varying System .... 26 

2.2.2 Baseband Equivalent Model 28 

2.2.3 A Discrete Time Baseband Model 31 

2.2.4 Additive White Noise 35 

2.3 Time and Frequency Coherence 36 

2.3.1 Doppler Spread and Coherence Time 36 

2.3.2 Delay Spread and Coherence Bandwidth 38 

2.4 Statistical Channel Models 41 

2.4.1 Modeling Philosophy 41 

2.4.2 Rayleigh and Rician Fading 42 

2.4.3 Tap Gain Autocorrelation Function 44 

3 Point-to-Point Communication: 

Detection, Diversity and Channel Uncertainty 59 

3.1 Detection in a Rayleigh Fading Channel 60 

3.1.1 Noncoherent Detection 60 



1 




Tse and Viswanath: Fundamentals of Wireless Communication 



2 



3.1.2 Coherent Detection 63 

3.1.3 From BPSK to QPSK: Exploiting the Degrees of Freedom ... 67 

3.1.4 Diversity 71 

3.2 Time Diversity 71 

3.2.1 Repetition Coding 73 

3.2.2 Beyond Repetition Coding 75 

3.3 Antenna Diversity 84 

3.3.1 Receive Diversity 84 

3.3.2 Transmit Diversity: Space-Time Codes 86 

3.3.3 MIMO: A 2 x 2 Example 89 

3.4 Frequency Diversity 95 

3.4.1 Basic Concept 95 

3.4.2 Single-Carrier with ISI Equalization 97 

3.4.3 Direct Sequence Spread Spectrum 104 

3.4.4 Orthogonal Frequency Division Multiplexing 109 

3.5 Impact of Channel Uncertainty 117 

3.5.1 Noncoherent Detection for DS Spread Spectrum 117 

3.5.2 Channel Estimation 120 

3.5.3 Other Diversity Scenarios 122 

3.6 Bibliographical Notes 127 

4 Cellular Systems: Multiple Access and Interference Management 139 

4.1 Introduction 139 

4.2 Narrowband Cellular Systems 142 

4.2.1 Narrowband allocations: GSM system 143 

4.2.2 Impact on Network and System Design 146 

4.2.3 Impact on Frequency Reuse 147 

4.3 Wideband Systems: CDMA 148 

4.3.1 CDMA Uplink 151 

4.3.2 CDMA Downlink 166 

4.3.3 System Issues 167 

4.4 Wideband Systems: OFDM 170 

4.4.1 Allocation Design Principles 170 

4.4.2 Hopping Pattern 171 

4.4.3 Signal Characteristics and Receiver Design 173 

4.4.4 Sectorization 174 

4.5 Bibliographical Notes 178 

4.6 Exercises 179 




Tse and Viswanath: Fundamentals of Wireless Communication 



3 



5 Capacity of Wireless Channels 193 

5.1 AWGN Channel Capacity 194 

5.1.1 Repetition Coding 194 

5.1.2 Packing Spheres 195 

5.2 Resources of the AWGN Channel 199 

5.2.1 Continuous-Time AWGN Channel 200 

5.2.2 Power and Bandwidth 200 

5.3 Linear Time- Invariant Gaussian Channels 206 

5.3.1 Single Input Multiple Output (SIMO) Channel 206 

5.3.2 Multiple Input Single Output (MISO) Channel 208 

5.3.3 Frequency- Selective Channel 209 

5.4 Capacity of Fading Channels 216 

5.4.1 Slow Fading Channel 216 

5.4.2 Receive Diversity 219 

5.4.3 Transmit Diversity 220 

5.4.4 Time and Frequency Diversity 226 

5.4.5 Fast Fading Channel 230 

5.4.6 Transmitter Side Information 234 

5.4.7 Frequency-Selective Fading Channels 246 

5.4.8 Summary: A Shift in Point of View 246 

5.5 Bibliographical Notes 251 

6 Multiuser Capacity and Opportunistic Communication 265 

6.1 Uplink AWGN Channel 266 

6.1.1 Capacity via Successive Interference Cancellation 266 

6.1.2 Comparison with Conventional CDMA 270 

6.1.3 Comparison with Orthogonal Multiple Access 270 

6.1.4 General Jl-user LIplink Capacity 272 

6.2 Downlink AWGN Channel 274 

6.2.1 Symmetric Case: Two Capacity- Achieving Schemes 275 

6.2.2 General Case: Superposition Coding Achieves Capacity 278 

6.3 Uplink Fading Channel 284 

6.3.1 Slow Fading Channel 284 

6.3.2 Fast Fading Channel 286 

6.3.3 Full Channel Side Information 288 

6.4 Downlink Fading Channel 291 

6.4.1 Channel Side Information at Receiver Only 292 

6.4.2 Full Channel Side Information 293 

6.5 Frequency-Selective Fading Channels 293 

6.6 Multiuser Diversity 294 

6.6.1 Multiuser Diversity Gain 294 




Tse and Viswanath: Fundamentals of Wireless Communication 



4 



6.6.2 Multiuser versus Classical Diversity 297 

6.7 Multiuser Diversity: System Aspects 299 

6.7.1 Fair Scheduling and Multiuser Diversity 300 

6.7.2 Channel Prediction and Feedback 307 

6.7.3 Opportunistic Beamforming using Dumb Antennas 308 

6.7.4 Multiuser Diversity in Multi-cell Systems 317 

6.7.5 A System View 318 

6.8 Bibliographical Notes 325 

7 MIMO I: Spatial Multiplexing and Channel Modeling 341 

7.1 Multiplexing Capability of Deterministic MIMO Channels 342 

7.1.1 Capacity via Singular Value Decomposition 342 

7.1.2 Rank and Condition Number 345 

7.2 Physical Modeling of MIMO Channels 347 

7.2.1 Line-of-Sight SIMO channel 347 

7.2.2 Line-of-Sight MISO Channel 349 

7.2.3 Antenna arrays with only a line-of-sight path 350 

7.2.4 Geographically separated antennas 351 

7.2.5 Line-of-sight plus one reflected path 361 

7.3 Modeling of MIMO Fading Channels 365 

7.3.1 Basic Approach 365 

7.3.2 MIMO Multipath Channel 366 

7.3.3 Angular Domain Representation of Signals 367 

7.3.4 Angular Domain Representation of MIMO Channels 370 

7.3.5 Statistical Modeling in the Angular Domain 372 

7.3.6 Degrees of Freedom and Diversity 372 

7.3.7 Dependency on Antenna Spacing 378 

7.3.8 I.I.D. Rayleigh Fading Model 387 

8 MIMO II: Capacity and Multiplexing Architectures 393 

8.1 The V-BLAST Architecture 394 

8.2 Fast Fading MIMO Channel 396 

8.2.1 Capacity with CSI at Receiver 396 

8.2.2 Performance Gains 399 

8.2.3 Full CSI 408 

8.3 Receiver Architectures 411 

8.3.1 Linear Decorrclator 411 

8.3.2 Successive Cancellation 417 

8.3.3 Linear MMSE Receiver 419 

8.3.4 ^Information Theoretic Optimality 427 

8.4 Slow Fading MIMO Channel 430 




Tse and Viswanath: Fundamentals of Wireless Communication 



5 



8.5 D-BLAST: An Outage-Optimal Architecture 433 

8.5.1 Sub-optimality of V-BLAST 433 

8.5.2 Coding Across Transmit Antennas: D-BLAST 435 

8.5.3 Discussion 438 

8.6 Bibliographical Notes 442 

9 MIMO IV: Multiuser Channels 453 

9.1 Uplink with Multiple Receive Antennas 454 

9.1.1 Space-Division Multiple Access 454 

9.1.2 SDMA Capacity Region 456 

9.1.3 System Implications 459 

9.1.4 Slow Fading 461 

9.1.5 Fast Fading 463 

9.1.6 Multiuser Diversity Revisited 465 

9.2 MIMO Uplink ' 470 

9.2.1 SDMA with Multiple Transmit Antennas 470 

9.2.2 System Implications 474 

9.2.3 Fast Fading 475 

9.3 Downlink with Multiple Transmit Antennas 476 

9.3.1 Degrees of Freedom in the Downlink 477 

9.3.2 Uplink-Downlink Duality and Transmit Beamforming 478 

9.3.3 Precocling for Interference Known at Transmitter 483 

9.3.4 Precocling for the downlink 496 

9.3.5 Fast Fading 499 

9.4 MIMO Downlink 502 

9.5 Multiple Antennas in Cellular Networks: A System View 505 

9.5.1 Inter-cell Interference Management 507 

9.5.2 Uplink with Multiple Receive Antennas 508 

9.5.3 MIMO Uplink 510 

9.5.4 Downlink with Multiple Receive Antennas 511 

9.5.5 Downlink with Multiple Transmit Antennas 512 

9.6 Bibliographical Notes 516 

A Detection and Estimation in Additive Gaussian Noise 532 

A.l Gaussian Random Variables 532 

A. 1.1 Scalar Real Gaussian Random Variable 532 

A. 1.2 Real Gaussian Random Vectors 533 

A. 1.3 Complex Gaussian Random Vectors 536 

A. 2 Detection in Gaussian Noise 539 

A. 2.1 Scalar Detection 539 

A. 2. 2 Detection in a Vector Space 540 




Tse and Viswanath: Fundamentals of Wireless Communication 



6 



A. 2. 3 Detection in a Complex Vector Space 544 

A. 3 Estimation in Gaussian Noise 546 

A. 3.1 Scalar Estimation 546 

A. 3. 2 Estimation in a Vector Space 547 

A. 3. 3 Estimation in a Complex Vector Space 548 

B Information Theory Background 552 

B. l Discrete Memoryless Channels 552 

B.2 Entropy, Conditional Entropy and Mutual Information 555 

B.3 Noisy Channel Coding Theorem 558 

B. 3.1 Reliable Communication and Conditional Entropy 559 

B.3. 2 A Simple Upper Bound 559 

B.3. 3 Achieving the Upper Bound 560 

B.3. 4 Operational Interpretation 563 

B.4 Formal Derivation of AWGN Capacity 563 

B.4.1 Analog Memoryless Channels 564 

B.4. 2 Derivation of AWGN Capacity 565 

B.5 Sphere Packing Interpretation 566 

B.5.1 Upper Bound 566 

B.5. 2 Achievability 567 

B.6 Time- Invariant Parallel Channel 570 

B.7 Capacity of the Fast Fading Channel 571 

B.7.1 Scalar Fast Fading Channnel 571 

B.7.2 Fast Fading MIMO Channel 572 

B.8 Outage Formulation 573 

B.9 Multiple Access Channel 575 

B.9.1 Capacity Region 575 

B.9. 2 Corner Points of the Capacity Region 576 

B.9. 3 Fast Fading Uplink 577 




Chapter 1 

Introduction and Book Overview 



1.1 Book Objective 

Wireless communication is one of the most vibrant research areas in the communication 
held today. While it has been a topic of study since the 60’s, the past decade has 
seen a surge of research activities in the area. This is due to a confluence of several 
factors. First is the explosive increase in demand for tetherless connectivity, driven 
so far mainly by cellular telephony but is expected to be soon eclipsed by wireless 
data applications. Second, the dramatic progress in VLSI technology has enabled 
small-area and low-power implementation of sophisticated signal processing algorithms 
and coding techniques. Third, the success of second-generation (2G) digital wireless 
standards, in particular the IS-95 Code Division Multiple Access (CDMA) standard, 
provides a concrete demonstration that good ideas from communication theory can 
have a significant impact in practice. The research thrust in the past decade has led 
to a much richer set of perspectives and tools on how to communicate over wireless 
channels, and the picture is still very much evolving. 

There are two fundamental aspects of wireless communication that make the prob- 
lem challenging and interesting. These aspects are by and large not as significant in 
wireline communication. First is the phenomenon of fading: the time- variation of the 
channel strengths due to the small-scale effect of multipath fading, as well as larger 
scale effects such as path loss via distance attenuation and shadowing by obstacles. 
Second, unlike in the wired world where each transmitter-receiver pair can often be 
thought of as an isolated point-to-point link, wireless users communicate over the air 
and there is significant interference between them in wireless communication. The 
interference can be between transmitters communicating with a common receiver (e.g. 
uplink of a cellular system), between signals from a single transmitter to multiple re- 
ceivers (e.g. downlink of a cellular system), or between different transmitter-receiver 
pairs (e.g. interference between users in different cells). How to deal with fading and 
with interference is central to the design of wireless communication systems, and will 
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be the central themes of this book. Although this book takes a physical-layer per- 
spective, it will be seen that in fact the management of fading and interference has 
ramifications across multiple layers. 

The book has two objectives and can be roughly divided into two corresponding 
parts. The first part focuses on the basic and more traditional concepts of the field: 
modeling of multipath fading channels, diversity techniques to mitigate fading, coher- 
ent and noncoherent receivers, as well as multiple access and interference management 
issues in existing wireless systems. Current digital wireless standards will be used as 
examples. The second part deals with the more recent developments of the field. Two 
particular topics are discussed in depth: opportunistic communication and space-time 
multiple antenna communication. It will be seen that these recent developments lead 
to very different points of view on how to deal with fading and interference in wireless 
systems. A particular theme is the multifaceted nature of channel fading. While fading 
has traditionally been viewed as a nuisance to be counteracted, recent results suggest 
that fading can in fact be viewed as beneficial and exploited to increase the system 
spectral efficiency. 

The expected background is solid undergraduate courses in signal and systems, 
probability and digital communication. It is expected that the readers of this book 
may have a wide range of backgrounds, and some of the appendices will be catered to 
providing supplementary background material. We will also try to introduce concepts 
from first principles as much as possible. Information theory has played a significant 
role in many of the recent developments in wireless communication, and we will use it 
as a coherent framework throughout the book. The level of sophistication at which we 
use information theory is however not high; we will cover all the required background 
in this book. 



1.2 Wireless Systems 

Wireless communication, despite the hype of the popular press, is a field that has 
been around for over a hundred years, starting around 1897 with Marconi’s successful 
demonstrations of wireless telegraphy. By 1901, radio reception across the Atlantic 
Ocean had been established; thus rapid progress in technology has also been around 
for quite a while. In the intervening hundred years, many types of wireless systems 
have flourished, and often later disappeared. For example, television transmission, 
in its early days, was broadcast by wireless radio transmitters, which is increasingly 
being replaced by cable transmission. Similarly, the point to point microwave circuits 
that formed the backbone of the telephone network are being replaced by optical fiber. 
In the first example, wireless technology became outdated when a wired distribution 
network was installed; in the second, a new wired technology (optical fiber) replaced 
the older technology. The opposite type of example is occurring today in telephony, 
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where wireless (cellular) technology is partially replacing the use of the wired telephone 
network (particularly in parts of the world where the wired network is not well devel- 
oped). The point of these examples is that there are many situations in which there 
is a choice between wireless and wire technologies, and the choice often changes when 
new technologies become available. 

In this book, we will concentrate on cellular networks, both because they are of 
great current interest and also because the features of many other wireless systems can 
be easily understood as special cases or simple generalizations of the features of cellular 
networks. A cellular network consists of a large number of wireless subscribers who 
have cellular telephones (mobile users), that can be used in cars, in buildings, on the 
street, or almost anywhere. There are also a number of fixed base stations, arranged 
to provide coverage (via wireless electromagnetic transmission) of the subscribers. 

The area covered by a base station, i.e., the area from which incoming calls reach 
that base station, is called a cell. One often pictures a cell as a hexagonal region with 
the base station in the middle. One then pictures a city or region as being broken 
up into a hexagonal lattice of cells (see Figure 1.2a). In reality, the base stations are 
placed somewhat irregularly, depending on the location of places such as building tops 
or hill tops that have good communication coverage and that can be leased or bought 
(see Figure 1.2b). Similarly, the mobile users connected to a base station are chosen 
by good communication paths rather than geographic distance. 




Part (a): an oversimplified view Part (b): a more realistic case where base 
in which each cell is hexagonal. stations are irregularly placed and cell phones 

choose the best base station 



Figure 1.1: Cells and Base stations for a cellular network 

When a mobile user makes a call, it is connected to the base station to which it 
appears to have the best path (often the closest base station). The base stations in a 
given area are then connected to a mobile telephone switching office (MTSO, also called 
a mobile switching center MSC) by high speed wire connections or microwave links. 
The MTSO is connected to the public wired telephone network. Thus an incoming call 
from a mobile user is first connected to a base station and from there to the MTSO and 
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then to the wired network. From there the call goes to its destination, which might 
be an ordinary wire line telephone, or might be another mobile subscriber. Thus, we 
see that a cellular network is not an independent network, but rather an appendage 
to the wired network. The MTSO also plays a major role in coordinating which base 
station will handle a call to or from a user and when to handoff a user from one base 
station to another. 

When another telephone (either wired or wireless) places a call to a given user, the 
reverse process takes place. First the MTSO for the called subscriber is found, then the 
closest base station is found, and finally the call is set up through the MTSO and the 
base station. The wireless link from a base station to a mobile user is interchangeably 
called the downlink or the forward channel , and the link from a user to a base station 
is called the uplink or a reverse channel. There are usually many users connected to a 
single base station, and thus, for the forward channels, the base station must multiplex 
together the signals to the various connected users and then broadcast one waveform 
from which each user can extract its own signal. The combined channel from the 
one base station to the multiple users is called a broadcast channel. For the reverse 
channels, each user connected to a given base station transmits its own waveform, and 
the base station receives the sum of the waveforms from the various users plus noise. 
The base station must then separate out the signals from each user and forward these 
signals to the MTSO. The combined channel from each user to the base station is 
called a multiaccess channel. 

Older cellular systems, such as the AMPS system developed in the U.S. in the 
80’s, are analog. That is, a voice waveform is modulated on a carrier and transmitted 
without being transformed into a digital stream. Different users in the same cell 
are assigned different modulation frequencies, and adjacent cells use different sets of 
frequencies. Cells sufficiently far away from each other can reuse the same set of 
frequencies with little danger of interference. 

All of the newer cellular systems are digital (i.e., they have a binary interface). Since 
these cellular systems, and their standards, were originally developed for telephony, the 
current data rates and delays in cellular systems are essentially determined by voice 
requirements. At present, these systems are mostly used for telephony, but both the 
capability to send data and the applications for data are rapidly increasing. Later on 
we will discuss wireless data applications at higher rates than those compatible with 
voice channels. 

As mentioned above, there are many kinds of wireless systems other than cellular. 
First there are the broadcast systems such as AM radio, FM radio, TV, and paging 
systems. All of these are similar to the broadcast part of cellular networks, although the 
data rates, the size of the areas covered by each broadcasting node, and the frequency 
ranges are very different. Next, there are wireless LANs (local area networks) These are 
designed for much higher data rates than cellular systems, but otherwise are similar to 
a single cell of a cellular system. These are designed to connect PC’s, shared peripheral 
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devices, large computers, etc. within an office building or similar local environment. 
There is little mobility expected in such systems and their major function is to avoid 
the mazes of cable that are strung around office buildings. There is a similar (even 
smaller scale) standard called Bluetooth whose purpose is to reduce cabling in an office 
and simplify transfers between office and hand held devices. Finally, there is another 
type of LAN called an ad hoc network. Here, instead of a central node (base station) 
through which all traffic flows, the nodes are all alike. The network organizes itself 
into links between various pairs of nodes and develops routing tables using these links. 
Here the network layer issues of routing, dissemination of control information, etc. are 
of primary concern rather than the physical layer issues of major interest here. 

One of the most important questions for all of these wireless systems is that of 
standardization. For cellular systems in particular, there is a need for standardization 
as people want to use their cell phones in more than just a single city. There are 
already three mutually incompatible major types of digital cellular systems. One is 
the GSM system which was standardized in Europe but now used worldwide, another 
is the TDMA (time-division multiple access) standard developed in the U.S. (IS-136), 
and a third is CDMA (code division multiple access) (IS-95). We discuss and contrast 
these briefly later. There are standards for other systems as well, such as the IEEE 
802.11 standards for wireless LANs. 

In thinking about wireless LANs and wide-area cellular telephony, an obvious ques- 
tion is whether they will some day be combined into one network. The use of data rates 
compatible with voice rates already exists in the cellular network, and the possibility 
of much higher data rates already exists in wireless LANs, so the question is whether 
very high data rates are commercially desirable for the standardized wide-area cellular 
network. The wireless medium is a much more difficult medium for communication 
than the wired network. The spectrum available for cellular systems is limited, the 
interference level is significant, and rapid growth is increasing the level of interference. 
Adding higher data rates will exacerbate this interference problem. In addition, the 
screen on hand held devices is small, limiting the amount of data that can be presented 
and suggesting that many existing applications of such devices do not need very high 
data rates. Thus whether very high speed data for cellular networks is necessary or 
desirable in the near future may depend very much on new applications. On the other 
hand, cellular providers are anxious to provide increasing data rates so as to be viewed 
as providing more complete service than their competitors. 

1.3 Book Outline 

The central object of interest is the wireless fading channel. Chapter 2 introduces the 
multipath fading channel model that we use for the rest of the book. Starting from a 
continuous-time passband channel, we derive a discrete-time complex baseband model 
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more suitable for analysis and design. We explain the key physical parameters such 
as coherence time, coherence bandwidth, Doppler spread and delay spread and survey 
several statistical models for multipath fading (due to constructive and destructive 
interference of multipaths). There have been many statistical models proposed in the 
literature; we will be far from exhaustive here. The goal is to have a small set of 
example models in our repertoire to illustrate the basic communication phenomena we 
will study. 

Chapter 3 introduces many of the issues of communicating over fading channels in 
the simplest point-to-point context. We start by looking at the problem of detection of 
uncoded transmission over a narrowband fading channel. We consider both coherent 
and noncoherent reception, i.e. with and without channel knowledge at the receiver 
respectively. We find that in both cases the performance is very poor, much worse 
than an AWGN channel with the same signal- to-noise ratio (SNR). This is clue to a 
significant probability that the channel is in deep fade. We study various diversity 
techniques to mitigate this adverse effect of fading. Diversity techniques increase reli- 
ability by sending the same information through multiple independently faded paths 
so that the probability of successful transmission is higher. Some of these techniques 
we will study include: 

• interleaving of coded symbols over time; 

• multipath combining or frequency hopping in spread-spectrum systems to obtain 
frequency diversity 

• use of multiple transmit or receive antennas, via space-time coding. 

• macro diversity via combining of signals received from or transmitted to multiple 
base stations (soft handoff) 

In some scenarios, there is an interesting interplay between channel uncertainty and 
the diversity gain: as the number of diversity branches increases, the performance of 
the system first improves due to the diversity gain but then subsequently deteriorates 
as channel uncertainty makes it more difficult to combine signals from the different 
branches. 

In Chapter 4 we shift our focus from point-to-point communication to studying 
cellular systems as a whole. Multiple access and inter-cell interference management 
are the key issues that come to the forefront. We explain how existing digital wireless 
systems deal with these issues. We discuss the concepts of frequency reuse and cell sec- 
torization, and contrast between narrowband systems such as GSM and IS-136, where 
users within the same cell are kept orthogonal and frequency is reused only in cells far 
away, and CDMA systems, where the signals of users both within the same cell and 
across different cells are spread across the same spectrum, i.e. frequency reuse factor of 
1. We focus particularly on the design principles of spread-spectrum CDMA systems. 
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In addition to the diversity techniques of time-interleaving, multipath combining and 
soft handoff, power control and interference averaging are the key mechanisms to man- 
age intra-cell and inter-cell interference respectively. All five techniques strive toward 
the same system goal: to maintain the channel quality of each user, as measured by 
the signal-to- interference- and-noise ratio (SINR), as constant as possible. We conclude 
this chapter with the discussion of a wideband orthogonal frequency division multi- 
plexing system (OFDM) which combines the advantages of CDMA and narrowband 
systems. 

In Chapter 5 we study the basic information theory of wireless channels. This gives 
us a higher level view of the tradeoffs involved in the earlier chapters as well as lays 
the foundation for understanding the more modern developments in the subsequent 
chapters. We use as a baseline for comparison the performance over the (non-faded) 
additive white Gaussian noise (AWGN) channel. We introduce the information theo- 
retic concept of channel capacity as the basic performance measure. The capacity of a 
channel provides the fundamental limit of communication achievable by any scheme. 
For the fading channel, there are several capacity measures, relevant for different sce- 
narios. Using these capacity measures, we define several resources associated with a 
fading channel: 1) diversity; 2) number of degrees of freedom; 3) received power. These 
three resources form a basis for assessing the nature of performance gain by the various 
communication schemes studied in the rest of the book. 

Chapters 6 to 9 cover the more recent developments in the field. In Chapter 6 we 
revisit the problem of multiple access over fading channels from a more fundamental 
point of view. Information theory suggests that if both the transmitters and the 
receiver can track the fading channel, the optimal strategy to maximize the total 
system throughput is to allow only the user with the best channel to transmit at any 
time. A similar strategy is also optimal for the downlink (one-to-many) . Opportunistic 
strategies of this type yield a system wide multiuser diversity gain: the more users in 
the system, the larger the gain, as there is more likely to have a user with a very strong 
channel. To implement the concept in a real system, three important considerations 
are: 1) fairness of the resource allocation across users, 2) delay experienced by the 
individual user waiting for its channel to become good, and 3) measurement inaccuracy 
and delay in feeding back the channel state to the transmitters. We discuss how these 
issues are addressed in the context of IS-865 (also called HDR or CDMA 2000 lx 
EV-DO), a third-generation wireless data system. 

A wireless system consists of multiple dimensions: time, frequency, space and users. 
Opportunistic communication maximizes the spectral efficiency by measuring when and 
where the channel is good and only transmits in those degrees of freedom. In this con- 
text, channel fading is beneficial in the sense that the fluctuation of the channel across 
the degrees of freedom ensures that there will be some degrees of freedom in which the 
channel is very good. This is in sharp contrast to the diversity-based approach we will 
discuss in Chapter 3, where channel fluctuation is always detrimental and the design 
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goal is to average out the fading to make the overall channel as constant as possible. 
Taking this philosophy one step further, we discuss a technique, called opportunistic 
beamforming, in which channel fluctuation can be induced in situations when the nat- 
ural fading has small dynamic range and/or is slow. From the cellular system point of 
view, this technique also increases the fluctuations of the interference imparted on ad- 
jacent cells, and presents an opposing philosophy to the notion of interference averaging 
in CDMA systems. 

Chapters 7, ?? and 9 discuss multi-input multi-output (MIMO) systems. It has 
been known for a while that a multiaccess system with multiple receive antennas allow 
several users to simultaneously communicate to the receiver. The multiple antennas 
in effect increase the number of degrees of freedom in the system and allow spatial 
separation of the signals from the different users. It has recently been shown that 
a similar effect occurs for point-to-point channel with multiple transmit and receive 
antennas, i.e. even when the antennas of the multiple users are co-located. This holds 
provided that the scattering environment is rich enough to allow the receive antennas 
separate out the signal from the different transmit antennas. This allows the spatial 
multiplexing of information. We see yet another example where channel fading is in 
fact beneficial to communication. 

Chapter 7 starts with a discussion of MIMO channel models. Capacity results 
in the point-to-point case are presented. We then describe several signal processing 
and coding schemes which achieve or approach the channel capacity. These schemes 
are based on techniques including singular-value decomposition, linear and decision- 
feedback equalization (also known as successive cancellation). As shown in Chapter 3, 
multiple antennas can also be used to obtain diversity gain, and so a natural question 
arises as how diversity and spatial multiplexing can be put in the same picture. In 
Chapter ??, the problem is formulated as a tradeoff between the diversity and multi- 
plexing gain achievable, and it is shown that for a given fading channel model, there 
is an optimal tradeoff between the two types of gains achievable by any space-time 
coding scheme. This is then used as a unified framework to assess both the diversity 
and multiplexing performance of several schemes. Finally, in Chapter 9, we extend 
our discussion to multiuser and multi-cellular systems. Here, in addition to provid- 
ing spatial multiplexing and diversity, multiple antennas can also be used to suppress 
interference. 




Chapter 2 

The Wireless Channel 



A good understanding of the wireless channel, its key physical parameters and the 
modeling issues, lays the foundation for the rest of the book. This is the goal of this 
chapter. 

A defining characteristic of the mobile wireless channel is the variations of the 
channel strength over time and over frequency. The variations can be roughly divided 
into two types: 

• large-scale fading , due to path loss of signal as a function of distance and shad- 
owing by large objects such as buildings and hills. This occurs as the mobile 
moves through a distance of the order of the cell size, and is typically frequency 
independent. 

• small-scale fading, clue to the constructive and destructive interference of the 
multiple signal paths between the transmitter and receiver. This occurs at the 
spatial scale of the order of the carrier wavelength, and is frequency dependent. 

We will talk about both types of fading in this chapter, but with more emphasis 
on the latter. Large-scale fading is more relevant to issues such as cell-site planning. 
Small-scale multipath fading is more relevant to the design of reliable and efficient 
communication systems - the focus of this book. 

We start with the physical modeling of the wireless channel in terms of electro- 
magnetic waves. We then derive an input-output linear time varying model for the 
channel, and define some important physical parameters. Finally we introduce a few 
statistical models of the channel variation over time and over frequency. 

2.1 Physical Modeling for Wireless Channels 

Wireless channels operate through electromagnetic radiation from the transmitter to 
the receiver. In principle, one could solve the electromagnetic field equations, in con- 
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Figure 2.1: Channel quality varies over multiple time scales. At a slow scale, channel 
varies due to large-scale fading effects. At a fast scale, channel varies due to multipath 
effects. 

junction with the transmitted signal, to find the electromagnetic field impinging on the 
receiver antenna. This would have to be done taking into account the obstructions 1 
caused by ground, buildings, vehicles, etc. in the vicinity of this electromagnetic wave. 

Cellular communication in the USA is limited by the Federal Communication Com- 
mission (FCC), and by similar authorities in other countries, to one of three frequency 
bands, one around 0.9 GHz, one around 1.9 GHz, and one around 5.8 GHz. The wave- 
length A (/) of electromagnetic radiation at any given frequency / is given by A = c/f, 
where c = 3 x 10 s m/s is the speed of light. The wavelength in these cellular bands 
is thus a fraction of a meter, so to calculate the electromagnetic field at a receiver, 
the locations of the receiver and the obstructions would have to be known within sub- 
meter accuracies. The electromagnetic held equations are therefore too complex to 
solve, especially on the fly for mobile users. Thus, we have to ask what we really need 
to know about these channels, and what approximations might be reasonable. 

One of the important questions is where to choose to place the base stations, and 
what range of power levels are then necessary on the downlink and uplink channels. 
To some extent this question must be answered experimentally, but it certainly helps 
to have a sense of what types of phenomena to expect. Another major question is 
what types of modulation and detection techniques look promising. Here again, we 
need a sense of what types of phenomena to expect. To address this, we will construct 
stochastic models of the channel, assuming that different channel behaviors appear 
with different probabilities, and change over time (with specific stochastic properties). 

1 By obstructions, we mean not only objects in the line-of-sight between transmitter and receiver, 
but also objects in locations that cause non-negligible changes in the electromagnetic field at the 
receiver; we shall see examples of such obstructions later. 
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We will return to the question of why such stochastic models are appropriate, but for 
now we simply want to explore the gross characteristics of these channels. Let us start 
by looking at several over-idealized examples. 



2.1.1 Free space, fixed transmitting and receive antennas 



First consider a fixed antenna radiating into free space. In the far field, 2 the electric 
field and magnetic field at any given location are perpendicular both to each other and 
to the direction of propagation from the antenna. They are also proportional to each 
other, so it is sufficient to know only one of them (just as in wired communication, 
where we view a signal as simply a voltage waveform or a current waveform). In 
response to a transmitted sinusoid cos 2ir ft, we can express the electric far held at 
time t as 



(r,0,VO) 



cos 2tt f (t - |) 
r 



( 2 . 1 ) 



Here, (r, 6, ij>) represents the point u in space at which the electric held is being mea- 
sured, where r is the distance from the transmitting antenna to u and where ( 9 , -0) 
represents the vertical and horizontal angles from the antenna to u, respectively. The 
constant c is the speed of light, and a s (0,^, /) is the radiation pattern of the send- 
ing antenna at frequency / in the direction ( 9 ,?/?); it also contains a scaling factor to 
account for antenna losses. Note that the phase of the held varies with f r/c, corre- 
sponding to the delay caused by the radiation travelling at the speed of light. 

We are not concerned here with actually hireling the radiation pattern for any given 
antenna, but only with recognizing that antennas have radiation patterns, and that 
the free space far held behaves as above. 

It is important to observe that, as the distance r increases, the electric held de- 
creases as r -1 and thus the power per square meter in the free space wave decreases 
as r -2 . This is expected, since if we look at concentric spheres of increasing radius 
r around the antenna, the total power radiated through the sphere remains constant, 
but the surface area increases as r 2 . Thus, the power per unit area must decrease as 
r -2 . We will see shortly that this r~ 2 reduction of power with distance is often not 
valid when there are obstructions to free space propagation. 

Next, suppose there is a hxed receive antenna at the location u = (r, 6, ip). The re- 
ceived waveform (in the absence of noise) in response to the above transmitted sinusoid 
is then 



E r (f, t, u) 



cos2n f(t - |) 
r 



( 2 . 2 ) 



where a(6 : ip,f) is the product of the antenna patterns of transmitting and receive 
antennas in the given direction. Our approach to (2.2) is a bit odd since we started 



2 The far field is the field sufficiently far away from the antenna so that (2.1) is valid. For cellular 
systems, it is a safe assumption that the receiver is in the far field. 
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with the free space held at u in the absence of an antenna. Placing a receive antenna 
there changes the electric held in the vicinity of u, but this is taken into account by 
the antenna pattern of the receive antenna. 

Now suppose, for the given u, that we define 

a(0,ip,f)e~ j2nfr / c 
r 

We then have E r (f,t,u ) = 3? [H(f)e j27T:ft ] . We have not mentioned it yet, but (2.1) 
and (2.2) are both linear in the input. That is, the received held (waveform) at u in 
response to a weighted sum of transmitted waveforms is simply the weighted sum of 
responses to those individual waveforms. Thus, H(f) is the system function for an 
LTI (linear time-invariant) channel, and its inverse Fourier transform is the impulse 
response. The need for understanding electromagnetism is to determine what this 
system function is. We will hnd in what follows that linearity is a good assumption for 
all the wireless channels we consider, but that the time invariance does not hold when 
either the antennas or obstructions are in relative motion. 




(2.3) 



2.1.2 Free space, moving antenna 

Next consider the fixed antenna and free space model above with a receive antenna that 
is moving with speed v in the direction of increasing distance from the transmitting 
antenna. That is, we assume that the receive antenna is at a moving location described 
as u(t) = (r(t),0,if)) with r(t) = r 0 + vt. Using (2.1) to describe the free space electric 
field at the moving point u(t) (for the moment with no receive antenna), we have 



(r 0 + vt,e,i/>)) 



a s (9^J) cos2tt fjt - ^-f) 
r 0 + vt 



(2.4) 



Note that we can rewrite f(t—r 0 /c—vt/c ) as f(l—v/c)t — fr 0 /c. Thus, the sinusoid at 
frequency / has been converted to a sinusoid of frequency /(l— u/c); there has been 
a Doppler shift of —fv/c due to the motion of the observation point. 3 Intuitively, 
each successive crest in the transmitted sinusoid has to travel a little further before it 
gets observed at the moving observation point. If the antenna is now placed at u(t), 
and the change of field due to the antenna presence is again represented by the receive 
antenna pattern, the received waveform, in analogy to (2.2), is 



{r 0 +vt,6,ij))) 



a(9 } -0, /) cos 2? r/ [(l-^)t - 
r 0 + vt 



(2.5) 



3 The reader should be familiar with the Doppler shift associated with moving cars. When an 
ambulance is rapidly moving toward us we hear a higher frequency siren. When it passes us we hear 
a rapid shift toward lower frequencies. 
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This channel cannot be represented as an LTI channel. If we ignore the time varying 
attenuation in the denominator of (2.5), however, we can represent the channel in 
terms of a system function followed by translating the frequency / by the Doppler 
shift —fv/c. It is important to observe that the amount of shift depends on the 
frequency /. We will come back to discussing the importance of this Doppler shift and 
of the time varying attenuation after considering the next example. 

The above analysis does not depend on whether it is the transmitter or the receiver 
(or both) that are moving. So long as r(t) is interpreted as the distance between the 
antennas (and the relative orientations of the antennas are constant), (2.4) and (2.5) 
are valid. 

2.1.3 Reflecting wall, fixed antenna 

Consider Figure 2.2 below in which there is a fixed antenna transmitting the sinusoid 
cos 27r ft, a fixed receive antenna, and a single perfectly reflecting large fixed wall. We 
assume that in the absence of the receive antenna, the electromagnetic field at the 
point where the receive antenna will be placed is the sum of the free space field coming 
from the transmit antenna plus a reflected wave coming from the wall. As before, in 
the presence of the receive antenna, the perturbation of the field due to the antenna 
is represented by the antenna pattern. An additional assumption here is that the 
presence of the receive antenna does not appreciably affect the plane wave impinging 
on the wall. In essence, what we have done here is to approximate the solution of 

Transmit 

Antenna I 



d 




receive antenna 



Figure 2.2: Illustration of a direct path and a reflected path 



Maxwell’s equations by a method called ray tracing. The assumption here is that 
the received waveform can be approximated by the sum of the free space wave from 
the sending transmitter plus the reflected free space waves from each of the reflecting 
obstacles. 

In the present situation, if we assume that the wall is very large, the reflected wave 
at a given point is the same (except for a sign change) as the free space wave that would 
exist on the opposite side of the wall if the wall were not present (see Figure 2.3). This 
means that the reflected wave from the wall has the intensity of a free space wave at 
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a distance equal to the distance to the wall and then back to the receive antenna, i.e., 
2d — r. Using (2.2) for both the direct and the reflected wave, and assuming the same 
antenna gain a for both waves, we get 

. acos2nf (t — -) a cos2nf (t — 

E r (f,t = ^ ^ s-l. 2.6 

r 2d — r 



Sending 

Antenna 




Figure 2.3: Relation of reflected wave to wave without wall. 



The received signal is a superposition of two waves, both of frequency /. The phase 
difference between the two waves is: 



A 6 



2r:f{2d — r) 






r) + 7T. 



(2.7) 



When the phase difference is an integer multiple of 2n, the two waves add constructively , 
and the received signal is strong. When the phase difference is an odd integer multiple 
of 7r, the two waves add destructively , and the received signal is weak. As a function 
of r, this translates into a spatial pattern of constructive and destructive interference 
of the waves. The distance from a peak to a valley is called the coherence distance : 



A 




At *— — 
. 4 


(2.8) 



where A := cj f is the wavelength of the transmitted sinusoid. 

The constructive and destructive interference pattern also depends on the frequency 
/: for a fixed r, if / changes by 



1 

2 



2d 




(2.9) 



we move from a peak to a valley. The quantity 
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is called the delay spread of the channel: it is the difference between the propagation 
delays along the two signal paths. Thus, the constructive and destructive interference 
pattern changes significantly if the frequency changes by an amount of the order of 
1/Tf}. This parameter is called the coherence bandwidth. 

2.1.4 Reflecting wall, moving antenna 

Suppose the receive antenna is now moving at a velocity v (Figure 2.4). As it moves 
through the pattern of constructive and destructive interference created by the two 
waves, the strength of the received signal increases and decreases. This is the phe- 
nomenon of multipath fading. The time taken to travel from a peak to a valley is 
c/(4fv): this is the time-scale at which the fading occurs, and it is called the coher- 
ence time of the channel. 

Sending 




v 



Figure 2.4: Illustration of a direct path and a reflected path 



An equivalent way of seeing this is in terms of the Doppler shifts of the direct and 
the reflected waves. Suppose the receive antenna is at location r 0 at time 0. Taking 
r = r 0 + vt in (2.6), we get: 



E r (f,t) = 



a cos 2vr/[(l - J)t - r f] « cos 27r f [( x + ~ c )t + 



r 0 + vt 



( 2 . 11 ) 



The first term, the direct wave, is a sinusoid of slowly decreasing magnitude at 
frequency /(I — v/c), experiencing a Doppler shift D\ := —fv/c. The second is a 
sinusoid of smaller but increasing magnitude at frequency /(I + v/c), with a Doppler 
shift D 2 := +fv/c . The parameter 



D s D 2 — D\ 



( 2 . 12 ) 
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Figure 2.5: The received waveform oscillating at frequency / with a slowly varying 
envelope at frequency D s /2. 



is called the Doppler spread. For example, if the mobile is moving at 60 km/h and 
/ = 900 MHz, the Doppler spread is 100 Hz. The role of the Doppler spread can be 
visualized most easily when the mobile is much closer to the wall than to the transmit 
antenna. In this case the attenuations are roughly the same for both paths, and we 
can approximate the denominator of the second term by r = r 0 + vt. Then, combining 
the two sinusoids, we get 



E r (f,t) 



2 a sin 2nf 



-J + 



( r o~ d ) 



sin27r f[t 



r n + vt 



(2.13) 



This is the product of two sinusoids, one at the input frequency /, which is typically 
on the order of GHz, and the other one at fv/c = D s / 2, which might be on the 
order of 50Hz. Thus, the response to a sinusoid at / is another sinusoid at / with a 
time- varying envelope, with peaks going to zeros around every 5 ms (Figure 2.5). The 
envelope is at its widest when the mobile is at a peak of the interference pattern and 
at its narrowest when the mobile is at a valley. Thus, the Doppler spread determines 
the rate of traversal across the interference pattern and is inversely proportional to the 
coherence time of the channel. 

We now see why we have partially ignored the denominator terms in (2.11) and 
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(2.13). When the difference in the length between two paths changes by a quarter 
wavelength, the phase difference between the responses on the two paths changes by 
7t/2, which causes a very significant change in the overall received amplitude. Since 
the carrier wavelength is very small relative to the path lengths, the time over which 
this phase effect causes a significant change is far smaller than the time over which the 
denominator terms cause a significant change. The effect of the phase changes is on 
the order of milliseconds, whereas the effect of changes in the denominator are of the 
order of seconds or minutes. In terms of modulation and detection, the time scales of 
interest are in the range of milliseconds and less, and the denominators are effectively 
constant over these periods. 

The reader might notice that we are constantly making approximations in trying to 
understand wireless communications, much more so than for wired communications. 
This is partly because wired channels are typically time-invariant over a very long 
time-scale, while wireless channels are typically time varying, and appropriate models 
depend very much on the time scales of interest. For wireless systems, the most impor- 
tant issue is what approximations to make. Solving and manipulating equations is far 
less important. Thus, it is important to understand these modeling issues thoroughly. 

2.1.5 Reflection from a Ground Plane 

Consider a transmitting and a receive antenna, both above a plane surface such as 
a road (see Figure 2.6). When the horizontal distance r between the antennas be- 
comes very large relative to their vertical displacements from the ground plane (i.e., 
height), a very surprising thing happens. In particular, the difference between the 
direct path length and the reflected path length goes to zero as r _1 with increasing r 
(see Exercise 2.5). 

Sending 




Figure 2.6: Illustration of a direct path and a reflected path off a ground plane 

When r is large enough, this difference between the path lengths becomes small 
relative to the wavelength c/f. Since the sign of the electric field is reversed on the 
reflected path, these two waves start to cancel each other out. The electric wave at 
the receiver is then attenuated as r -2 , and the received power decreases as r~ 4 . This 




Tse and Viswanath: Fundamentals of Wireless Communication 



24 



situation is particularly important in rural areas where base stations tend to be placed 
on roads 4 . 

2.1.6 Power Decay with Distance and Shadowing 

The previous example with reflection from a ground plane suggests that the received 
power can decrease with distance faster than r~ 2 in the presence of disturbances to 
free space. In practice, there are several obstacles between the transmitter and the 
receiver and, further, the obstacles might also absorb some power while scattering the 
rest. Thus, one expects the power decay to be considerably faster than r~ 2 . Indeed, 
empirical evidence from experimental field studies suggests that while power decay 
near the transmitter is like r -2 , at large distances the power decays exponentially with 
distance. 

The ray tracing approach used so far provides a high degree of numerical accuracy 
in determining the electric field at the receiver, but requires a precise physical model 
including the location of the obstacles. But here, we are only looking for the order of 
decay of power with distance and can consider an alternative approach. So we look 
for a model of the physical environment with the fewest number of parameters but 
one that still provides useful global information about the field properties. A simple 
probabilistic model with two parameters of the physical environment: the density of 
the obstacles and the nature of the obstacles (scatterer or absorber) is developed in 
Exercise 2.6. With each obstacle absorbing a positive fraction of the energy impinging 
on it, the model allows us to show that the power decays exponentially in distance at 
a rate that is proportional to the density of the obstacles. 

With a limit on the transmit power (either at the base station or at the mobile) 
the largest distance between the base station and a mobile at which communication 
can reliably take place is called the coverage of the cell. For reliable communication, 
a minimal received power level has to be met and thus the fast decay of power with 
distance constrains cell coverage. On the other hand, rapid signal attenuation with 
distance is also helpful; it reduces the interference between adjacent cells. As cellular 
systems become more popular, however, the major determinant of cell size is the 
number of mobiles in the cell. In engineering jargon, the cell is said to be capacity 
limited instead of coverage limited. The size of cells has been steadily decreasing, 
and one talks of micro cells and pico cells as a response to this effect. With capacity 
limited cells, the inter-cell interference may be intolerably high. To alleviate the inter- 
cell interference, neighboring cells use different parts of the frequency spectrum, and 
frequency is reused at cells that are far enough. Rapid signal attenuation with distance 
allows frequencies to be reused at closer distances. 

4 Since the ground plane is modeled as a perfect scatterer (i.e., there is no loss of energy in scat- 
tering), there are other receiver positions where the power decays slower than r~ 2 . 
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The density of obstacles between the transmit and receive antennas depends very 
much on the physical environment. For example, outdoor plains have very little by 
way of obstacles while indoor environments pose many obstacles. This randomness in 
the environment is captured by modeling the density of obstacles and their absorption 
behavior as random numbers; the overall phenomenon is called shadowing 5 . The effect 
of shadow fading differs from multipath fading in an important way. The duration of a 
shadow fade lasts for multiple seconds or minutes, and hence occurs at a much slower 
time-scale compared to multipath fading. 

2.1.7 Moving Antenna, Multiple Reflectors 

Dealing with multiple reflectors, using the technique of ray tracing, is in principle 
simply a matter of modeling the received waveform as the sum of the responses from 
the different paths rather than just two paths. We have seen enough examples, however, 
to understand that finding the magnitude and phase of these responses is no simple 
task. Even for the very simple large wall example in Figure 2.2, the reflected field 
calculated in (2.6) is valid only at distances from the wall that are small relative to the 
dimensions of the wall. At very large distances, the total power reflected from the wall 
is proportional to both d~ 2 and to the area of the cross section of the wall. The power 
reaching the receiver is proportional to ( d — r(t))~ 2 . Thus, the power attenuation from 
transmitter to receiver (for the large distance case) is proportional to (d(d — r(t)))~ 2 
rather than to (2d — r(f)) -2 . This shows that ray tracing must be used with some 
caution. Fortunately, however, linearity still holds in these more complex cases. 

Another type of reflection is known as scattering and can occur in the atmosphere or 
in reflections from very rough objects. Here there are a very large number of individual 
paths, and the received waveform is better modeled as an integral over paths with 
infinitesimally small differences in their lengths, rather than as a sum. 

Knowing how to find the amplitude of the reflected field from each type of re- 
flector is helpful in determining the coverage of a base station (although, ultimately 
experimentation is necessary). This is an important topic if our objective is trying to 
determine where to place base stations. Studying this in more depth, however, would 
take us afield and too far into electromagnetic theory. In addition, we are primarily in- 
terested in questions of modulation, detection, multiple access, and network protocols 
rather than location of base stations. Thus, we turn our attention to understanding 
the nature of the aggregate received waveform, given a representation for each reflected 
wave. This leads to modeling the input/output behavior of a channel rather than the 
detailed response on each path. 



5 This is called shadowing because it is similar to the effect of clouds partly blocking sunlight. 
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2.2 Input/Output Model of the Wireless Channel 

We derive an input /output model in this section. We first show that the multipath 
effects can be modeled as a linear time varying system. We then obtain a baseband 
representation of this model. The continuous-time channel is then sampled to obtain 
a discrete-time model. Finally we incorporate additive noise. 



2.2.1 The Wireless Channel as a Linear Time- Varying System 

In the previous section we focussed on the response to the sinusoidal input 0(0) = 
cos 27r/f. The received signal can be written as Gq(/, f)0(f — Tj(/, t )), where a;(/, t ) 
and Tj(/, t) are respectively the overall attenuation and propagation delay at time t 
from the transmitter to the receiver on path i. The overall attenuation is simply the 
product of the attenuation factors due to the antenna pattern of the transmitter and 
the receiver, the nature of the reflector, as well as a factor that is a function of the 
distance from the transmitting antenna to the reflector and from the reflector to the 
receive antenna. We have described the channel effect at a particular frequency /. If 
we further assume that the a*(/, t)’s and the Tj (/,£)’ s do not depend on the frequency 
/, then we can use the principle of superposition to generalize the above input-output 
relation to an arbitrary input x(t) with nonzero bandwidth: 



y{t) = ^ cii{t)x(t - nit)). 

i 



(2.14) 



In practice the attenuations and the propagation delays are usually slowly varying 
functions of frequency. These variations follow from the time-varying path lengths and 
also from frequency dependent antenna gains. However, we are primarily interested 
in transmitting over bands that are narrow relative to the carrier frequency, and over 
such ranges we can omit this frequency dependence. It should however be noted that 
although the individual attenuations and delays are assumed to be independent of the 
frequency, the overall channel response can still vary with frequency due to the fact 
that different paths have different delays. 

For the example of a perfectly reflecting wall in Figure 2.4, then, 



n (t) 



ai(t) = 

r 0 + vt 
c 



Fl 

r 0 + vt ’ 

Z0i 



2tt/’ 



a 2 (t) 
F2 (t) 



FI 

2d — r 0 — vt ’ 

2d — r 0 — vt Z0 2 
c 2ir S' 



(2.15) 

(2.16) 



where the first expression is for the direct path and the second for the reflected path. 
The term Z(pj here is to account for possible phase changes at the transmitter, reflector, 
and receiver. For the example here, there is a phase reversal at the reflector so we take 
0i = 0 and 02 = 7T. 
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Since the channel (2.14) is linear, it can be described by the response h(r,t ) at 
time t to an impulse transmitted at time t — r. In terms of h(r,t ), the input-output 
relationship is given by: 

/ OO 

h(r, t)x(t — r)dr. (2-17) 

-OO 

Comparing (2.17) and (2.14), we see that the impulse response for the fading multipath 
channel is: 

h(r,t) = y ^aj(t)S(r - Tj(t)). (2.18) 

i 

This expression is really quite nice. It says that the effect of mobile users, arbitrarily 
moving reflectors and absorbers, and all of the complexities of solving Maxwell’s equa- 
tions, finally reduce to an input /output relation between transmit and receive antennas 
which is simply represented as the impulse response of a linear time-varying channel 
filter. 

The effect of the Doppler shift is not immediately evident in this representation. 
From (2.16) for the single reflecting wall example, t' ( t) = vi/c where v l is the velocity 
with which the i th path length is increasing. Thus, the Doppler shift on the i th path is 

In the special case when the transmitter, receiver and the environment are all 
stationary, the attenuations a*(t)’ s and propagation delays Tj(t)’s do not depend on 
time t, and we have the usual linear time-invariant channel with an impulse response 

h(r) = ^2 a iS(T - Ti). (2.19) 

i 

For the time- varying impulse response h(r, t), we can define a time- varying fre- 
quency response 



/ OO 

h(r, t)e~ j2nfr dT = ai(t)e~ j2nfTi ^ . (2.20) 

-OO j 

In the special case when the channel is time-invariant, this reduces to the usual fre- 
quency response. One way of interpreting H(f-,t) is to think of the system as a slowly 
varying function of t with a frequency response H(f; t) at each fixed time t. Corre- 
sponding, h(r, t) can be thought of as the impulse response of the system at a fixed 
time t. This is a legitimate and useful way of thinking about multipath fading chan- 
nels, as the time-scale at which the channel varies is typically much longer than the 
delay spread of the impulse response at a fixed time. In the reflecting wall example in 
Section 2.1.4, the time taken for the channel to change significantly is of the order of 
milliseconds while the delay spread is of the order of microseconds. Fading channels 
which have this characteristic are sometimes called underspread channels. 
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Figure 2.7: Illustration of the relationship between a passband spectrum S(f) and its 
baseband equivalent S7(/). 



2.2.2 Baseband Equivalent Model 

In typical wireless applications, communication occurs in a passband [f c — y • f c + y] 
of bandwidth W around a center frequency / c , the spectrum having been specified 
by regulatory authorities. However, most of the processing, such as coding/decoding, 
modulation/demodulation, synchronization, etc, is actually done at the baseband. At 
the transmitter, the last stage of the operation is to “up-convert” the signal to the 
carrier frequency and transmit it via the antenna. Similarly, the first step at the 
receiver is to “down-convert” the RF (radio-frequency) signal to the baseband before 
further processing. Therefore from a communication system design point of view, it is 
most useful to have a baseband equivalent representation of the system. We first start 
with defining the baseband equivalent representation of signals. 

Consider a real signal s(t) with Fourier transform S(f), bandlimited in [f c — 
W/2,f c + W/ 2] with W < 2 f c . Define its complex baseband equivalent s 6 (t ) as the 
signal having Fourier transform: 



W) 



V2S(f + f c ) f + f c > 0 

0 f + fc< 0 ' 



( 2 . 21 ) 



Since s(t) is real, its Fourier transform is Hcrmitian around / = 0, which means that 
Sb(t) contains exactly the same information as s(t). The factor of \/2 is quite arbitrary 
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3?[s b (t)] 



\/2 cos 2n f c t 



s(t) 



— TV 
2 








Figure 2.8: Illustration of upconversion from Sb(t) to s(t) and followed by downcon- 
version from s(t) back to s&(t). 



bnt chosen to normalize the energies of s 6 (t) and s(t) to be the same. Note that Sb(t) 
is bandlimited in [— W/2, W/2], See Figure 2.7. 

To reconstruct s(t) from s&(t), we observe that: 

V2S(f) = S b (f - f c ) + S' b (-f - f c ). (2.22) 

Taking inverse Fourier transforms, we get 

s(t) = 4 = {s b (t)e j2nfct + s* b (t)e- j27rfct } = V2M [s b {t)e j27rfct ] . (2.23) 

V2 

In terms of real signals, the relationship between s(t) and Sb{t) is shown in Figure 
2.8. The passband signal s(t) is obtained by modulating by v^2cos27r f c t 

and S[s&(£)] by — \/2 sin 2nf c t and summing, to get 3? [sb(t)e^ 277 ^ ct ] (up-conversion). 
The baseband signal 9£[si,(t)] (respectively A[s{,(f)]) is obtained by modulating s(t) 
by \/2 cos 2nf c t (respectively y/2 sin 2nf c t) followed by ideal low-pass filtering at the 
baseband [— W/2, W/2] (down-conversion). 

Let ns now go back to the multipath fading channel (2.14) with impulse response 
given by (2.18). Let Xbit) and yb(t) be the complex baseband equivalents of the trans- 
mitted signal x(t) and the received signal y(t), respectively. Figure 2.9 shows the 
system diagram from Xb{t) to yb(t). This implementation of a passband communica- 
tion system is known as quadrature amplitude modulation (QAM). The signal $ft[x&(£)] 
is sometimes called the in-phase component, I, and 3f[xj,(£)] the quadrature component, 








Tse and Viswanath: Fundamentals of Wireless Communication 



30 



Q, (rotated by n/2.) We now calculate the baseband equivalent channel. Substituting 
x(t) = \/2 lfc[x b {t)e : >‘ 2 ' K f ct } and y(t) = y^2^t[y b (t)e j27r ^ ct ] into (2.14) we get: 

& [y b {t)e^] = Y W* - , 



= 3 ? 




ai(t)x b (t - i i(t))e- j2nfcTi ® l e j2nfct 



(2.24) 



Similarly, one can obtain (see Exercise 2.14) 



ybity 2 ^] = a 




a r (t)x b (t - Ti(t))e- j2nfcTiW l e j2nfct 



(2.25) 



Hence, the baseband equivalent channel is: 

Vb(t) = Y ( 2 - 26 ) 

i 

where 

a^t) := Oi(t)e- j27rfcTi ^. (2.27) 

The input-output relationship in (2.26) is also that of a linear time-varying system, 
and the baseband equivalent impulse response is: 

h{r, t) = Y a i( t ) 5 ( r ~ ( 2 - 28 ) 



This representation is easy to interpret in the time domain, where the effect of the 
carrier frequency can be seen explicitly. The baseband output is the sum, over each 
path, of the delayed replicas of the baseband input. The magnitude of the i th such 
term is the magnitude of the response on the given path; this changes slowly, with 
significant changes occurring on the order of seconds or more. The phase is changed 
by 7t/2 (i.e., is changed significantly) when the delay on the path changes by l/(4/ c ), 
or equivalently, when the path length changes by a quarter wavelength, i.e., by c/ (4 f c ). 
If the path length is changing at velocity v, the time required for such a phase change 
is c/(4/ c u). Recalling that the Doppler shift D at frequency / is fv/c, and noting that 
/ ~ / c for narrow band communication, the time required for a n/2 phase change is 
1/(4 D). For the single reflecting wall example, this is about 5 ms (assuming f c = 900 
MHz and v = 60 km/h). The phases of both paths are rotating at this rate but in 
opposite directions. 

Note that the Fourier transform H b (f;t ) of h b (r,t) for a fixed t is simply H(f + 
f c \t ), i.e., the frequency response of the original system (at a fixed t) shifted by the 
carrier frequency. This provides another way of thinking about the baseband equivalent 
channel. 
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Figure 2.9: System diagram from the baseband transmitted signal Xb(t) to the baseband 
received signal yb(t). 

2.2.3 A Discrete Time Baseband Model 

The next step in creating a useful channel model is to convert the continuous time 
channel to a discrete time channel. We take the usual approach of the sampling 
theorem. Assume that the input waveform x(t) is bandlimited to W. The baseband 
equivalent is then limited to W/2 and can be represented as 

x b (t) = x[n] sine (ITT — n), (2.29) 

n 

where x[n] is given by Xb(n/W) and sinc(f) is defined as: 

sinc(i) := si ‘A!l. (2.30) 

Tit 

This representation follows from the sampling theorem, which says that any waveform 
bandlimited to W/2 can be expanded in terms of the orthogonal basis {sinc(Wf — n)} n , 
with coefficients given by the samples (taken uniformly at integer multiples of 1/W). 
Using (2.26), the baseband output is given by 

Ub(t) = x[n\ d\{t ) sine (Wt — Wr^t) — n) . (2-31) 

n i 

The sampled outputs at multiples of 1/W, y[m] := yb(m/W), are then given by 

y[m\ = (m/W)sinc[m — n — Ti(m/W)W}. (2.32) 

n i 
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The sampled output y[m] can equivalently be thought as the projection of the waveform 
Db{t) onto the waveform W sinc(WT — m). Let £ :— m — n. Then 

y[m] = E x[m — £] a\{m/W ) sine [l — Tj{m/W)W]. (2.33) 

£ i 



By defining 



h([m\ : = E a\{m/W) sinc[£ - Ti(m/W)W], (2.34) 

i 



(2.33) can be written in the simple form 

y[m\ = E he[m] x[m — £\. (2.35) 

i 

We denote hi[m] as the I th (complex) channel filter tap at time m. Its value is a 
function of mainly the gains a\{t) of the paths, whose delays Tj(t) are close to tjW 
(Figure 2.10). In the special case where the gains a^(t)’s and the delays Tj(t)’s of the 
paths are time-invariant, (2.34) simplifies to: 

hi = sinc[l TjW], (2.36) 

i 

and the channel is linear time-invariant. The £ th tap can be interpreted as samples of 
the low-pass filtered baseband channel response hb(r) (c.f. (2.19)): 

h e = (h b *8mc)(£/W). (2.37) 

where * is the convolution operation. 

We can interpret the sampling operation as modulation and demodulation in a 
communication system. At time n, we are modulating the complex symbol x[n] (in- 
phase plus quadrature components) by the sine pulse before the up-conversion. At the 
receiver, the received signal is sampled at times m/W at the output of the low-pass 
filter. Figure 2.11 shows the complete system. In practice, other transmit pulses, such 
as the raised cosine pulse, are often used in place of the sine pulse, which has rather 
poor time-decay property and tends to be more susceptible to timing errors. This 
necessitates sampling at a rate below the Nyquist sampling rate, but does not alter 
the essential nature of the following descriptions. Hence we will confine to Nyquist 
sampling. 

Due to the Doppler spread, the bandwidth of the output yb(t) is generally slightly 
larger than the bandwidth W / 2 of the input Xb(t), and thus the output samples {y[m\} 
do not fully represent the output waveform. This problem is usually ignored in practice, 
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1 

W 




Main contribution -1 = 0 



Main contribution -1 = 0 



Main contribution -1 = 1 



Main contribution -1 = 2 



Main contribution -1 = 2 



Figure 2.10: Due to the decay of the sine function, the i th path contributes most 
significantly to the £ th tap if its delay falls in the window [£/W — 1/(2 W),£/W + 
1/(2 W)\. 
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5Rp[m]] 



inc (Wt — m) 



5R[x 6 (t) 




\/2 cos 2irf c t 





0[a;[m]] 






*4in c(Wt — m) [ 




vd) 




Figure 2.11: System diagram from the baseband transmitted symbol x[m] to the base- 
band sampled received signal y[m]. 



since the Doppler spread is small (of the order of lO’s-lOO’s of Hz) compared to the 
bandwidth W. Also, it is very convenient for the sampling rate of the input and output 
to be the same. Alternatively, it would be possible to sample the output at twice the 
rate of the input. This would recapture all the information in the received waveform. 
The number of taps would be almost doubled because of the reduced sample interval, 
but it would typically be somewhat less than doubled since the representation would 
not spread the path delays so much. 



Discussion 2.1: Degrees of Freedom 

The symbol x[m] is the m th sample of the transmitted signal; there are W 
samples per second. Each symbol is a complex number; we say that it represents 
one (complex) dimension or degree of freedom. The continuous time signal x(t) of 
duration one second corresponds to W discrete symbols; thus we could say that 
the bandlimited continuous time signal has W degrees of freedom per second. 

The mathematical justification for this interpretation comes from the following 
important result in communication theory: the signal space of complex continuous 
time signals of duration T which have most of their energy within the frequency 
band [— W/2,W/2] has dimension approximately WT. (A precise statement of 
this result can be found in [?].) This result reinforces our interpretation that a 
continuous time signal with bandwidth W can be represented by W complex 
dimensions per second. 

The received signal y(t) is also bandlimited to approximately W (due to the 
Doppler spread, the bandwidth is slightly larger than W ) and has W complex 










Tse and Viswanath: Fundamentals of Wireless Communication 



35 



dimensions per second. From the point of view of communication over the 
channel, the received signal space is what matters because it dictates the number 
of different signals which can be reliably distinguished at the receiver. Thus, we 
define the degrees of freedom of the channel to be the dimension of the received 
signal space, and whenever we refer to the signal space, we implicitly mean the 
received signal space unless stated otherwise. 



2.2.4 Additive White Noise 

As a last step, we include additive noise in our input/output model. We make the 
standard assumption that w(t) is zero-mean additive white Gaussian noise (AWGN) 
with power spectral density N 0 /2 (i.e., E[w(0)w(t)] = S(t )). The model (2.14) is 

now modified to be: 

y(t) = ^2 a.i(t)x(t - Ti(t )) + w(t). (2.38) 

i 

See Figure 2.12. The discrete-time baseband-equivalent model (2.35) now becomes 

y[m] = E h([m]x[m — i] + w[m], (2.39) 

i 



where w\m] is the low-pass filtered noise at the sampling instant m/W. Just like the 
signal, the white noise w(t) is down-converted, filtered at the baseband and ideally 
sampled. Thus, it can be verified (see Exercise 2.12) that 





3 ft(w[m]) = j 


roo 

1 w(t)i> m)1 (t)dt, 

— oo 


(2.40) 




A(w[m]) = j 


r*oo 

1 w(t)if m , 2 (t)dt, 

— OO 


(2.41) 


where 




:= V2W cos(2nf c t) sine (Wt—m), 


<t>m,2(t) := -\/2W sin 


( 2nf c t ) , sine (Wt—m 
(2.42) 



It can further be shown that {'f’mnft), forms an orthonormal set of waveforms, 

i.e., the waveforms are orthogonal to each other. (See Exercise 2.13.) In Appendix A 
we review the definition and basic properties of white Gaussian random vectors (i.e., 
vectors whose components are independent and identically distributed (i.i.d.) Gaussian 
random variables.) A key property is that the projections of a white Gaussian random 
vector onto any orthonormal vectors are independent and identically distributed Gaus- 
sian random variables. Heuristically, one can think of continuous-time Gaussian white 
noise as an infinite-dimensional white random vector and the above property carries 
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Figure 2.12: A complete system diagram. 

through: the projections onto orthogonal waveforms are uncorrelated and hence inde- 
pendent. Hence the discrete-time noise process {w[m]} is white, i.e., independent over 
time, moreover, the real and imaginary components are i.i.d. Gaussians with variances 
Nq/2. A complex Gaussian random variable X whose real and imaginary components 
are i.i.d. satisfies a circular symmetry property: e^X has the same distribution as 
X for any (ft . We shall call such a random variable circular symmetric complex Gaus- 
sian, denoted by CJ\f(0,a 2 ), where a 2 = A[|A"| 2 ]. The concept of circular symmetry is 
discussed further in Section A. 1.3 of Appendix A. 

The assumption of AWGN essentially means that we are assuming that the primary 
source of the noise is at the receiver or is radiation impinging on the receiver that is 
independent of the paths over which the signal is being received. This is normally a 
very good assumption for most communication situations. 

2.3 Time and Frequency Coherence 

2.3.1 Doppler Spread and Coherence Time 

An important channel parameter is the time-scale of the variation of the channel. How 
fast do the taps h([m\ vary as a function of time ml Recall that 

hg[m] = E a\{m/W) sine [£ — Ti(m/W)W ] , 

i 

= J2 a i( m /W ) e _J,27r/cTi (m/ VK) sine [£ - n{m/W)W } . (2.43) 
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Let us look at this expression term by term. From Section 2.2.2 we gather that signif- 
icant changes in a* occur over periods of seconds or more. Significant changes in the 
phase of the i th path occur at intervals of 1/(4 Df), where Di = / c r ? - (f) is the Doppler 
shift for that path. When the different paths contributing to the £ th tap have different 
Doppler shifts, the magnitude of h(\m] changes significantly. This is happening at the 
time-scale inversely proportional to the largest difference between the Doppler shifts, 
the Doppler spread D s : 



D s := max f c \r-(t) 

hj 



m i, 



(2.44) 



where the maximum is taken over all the paths that contribute significantly to a 
tap. 6 Typical intervals for such changes are on the order of 10 ms. Finally, changes 
in the sine term of (2.43) due to the time variation of each r*(t) are proportional to 
the bandwidth, whereas those in the phase are proportional to the carrier frequency, 
which is much larger. Essentially, it takes much longer for a path to move from one 
tap to the next than for its phase to change significantly. Thus, the fastest changes in 
the filter taps occur because of the phase changes, and these are significant over delay 
changes of 1/(4 D s ). 

The coherence time, T c , of a wireless channel is defined (in an order of magnitude 
sense) as the interval over which ht [m] changes significantly as a function of m. What 
we have found, then, is the important relation: 



1 




T — 

° 4 D a ' 


(2.45) 



This is a somewhat imprecise relation, since the largest Doppler shifts may belong 
to paths that are too weak to make a difference. We could also view a phase change of 
7t/4 to be significant, and thus replace the factor of 4 above by 8. Many people instead 
replace the factor of 4 by 1. The important thing is to recognize that the major 
effect in determining time coherence is the Doppler spread, and that the relationship 
is reciprocal; the larger the Doppler spread, the smaller the time coherence. 

In the wireless communication literature, channels are often categorized as fast 
fading and slow fading, but there is little consensus on what these terms mean. In 
this book, we will call a channel fast fading if the coherence time T c is much shorter 
than the delay requirement of the application, and slow fading if T c is longer. The 
operational significance of this definition is that in a fast fading channel, one can 
transmit the coded symbols over multiple fades of the channel, while in a slow fading 

6 The Doppler spread can in principle be different for different taps. Exercise 2.8 explores this 
possibility. 
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channel, one cannot. Thus, whether a channel is fast or slow fading depends not only 
on the environment but also on the application; voice, for example, typically has a 
short delay requirement of less than 100 ms, while some types of data applications can 
have a laxer delay requirement. 

2.3.2 Delay Spread and Coherence Bandwidth 

Another important general parameter of a wireless system is the multipath delay 
spread, T d , defined as the difference in propagation time between the longest and 
shortest path, counting only the paths with significant energy. Thus, 



T d := max | r^t) - Tj(t)\. 
hi 



(2.46) 



This is defined as a function of t, but we regard it as an order of magnitude quantity, 
like the time coherence and Doppler spread. If a cell or LAN has a linear extent of a 
few kilometers or less, it is very unlikely to have path lengths that differ by more than 
300 to 600 meters. This corresponds to path delays of one or two /as. As cells become 
smaller clue to increased cellular usage, T d also shrinks. As was already mentioned, 
typical wireless channels are underspread, which means that the delay spread T d is 
much smaller than the coherence time T c . 

The bandwidths of cellular systems range between several hundred kHz and several 
MHz, and thus, for the above multipath delay spread values, all the path delays in 
(2.34) lie within the peaks of 2 or 3 sine functions; more often, they he within a single 
peak. Adding a few extra taps to each channel Liter because of the slow decay of 
the sine function, we see that cellular channels can be represented with at most 4 or 
5 channel filter taps. On the other hand, there is a recent interest in ultrawideband 
(UWB) communication, operating from 3.1 to 10.6 GHz. These channels can have up 
to a few hundred taps. 

When we study modulation and detection for cellular systems, we shall see that the 
receiver must estimate the values of these channel filter taps. The taps are estimated 
via transmitted and received waveforms, and thus the receiver makes no explicit use 
of (and usually does not have) any information about individual path delays and path 
strengths. This is why we have not studied the details of propagation over multiple 
paths with complicated types of reflection mechanisms. All we really need is the 
aggregate values of gross physical mechanisms such as Doppler spread, coherence time, 
and multipath spread. 

The delay spread of the channel dictates its frequency coherence. Wireless channels 
change both in time and frequency. The time coherence shows us how quickly the 
channel changes in time, and similarly, the frequency coherence shows how quickly 
it changes in frequency. We first understood about channels changing in time, and 
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correspondingly about the duration of fades, by studying the simple example of a direct 
path and a single reflected path. That same example also showed us how channels 
change with frequency. We can see this in terms of the frequency response as well. 

Recall that the frequency response at time t is 

H(f;t) = '£a i (t)e- j2 * fTi ®. (2.47) 

i 

The contribution due to a particular path has linear phase in /. For multiple paths, 
there is a differential phase, 2n firflt) — Tkit)). This differential phase causes selective 
fading in frequency. This says that E r (f,t ) changes significantly, not only when t 
changes by 1/(4 D), but also when / changes by 1/2 T d . This argument extends to an 
arbitrary number of paths, so the coherence bandwidth , W c , is given by 



1 




w c = — . 

2 T d 


(2.48) 



This relationship, like (2.45), is intended as an order of magnitude relation, essen- 
tially pointing out that the coherence bandwidth is reciprocal to the multipath spread. 
When the bandwidth of the input is considerably less than W c , the channel is usually 
referred to as flat fading. In this case, the delay spread T d is much less than the symbol 
time 1 /W , and a single channel filter tap is sufficient to represent the channel. When 
the bandwidth is much larger than W c , the channel is said to be frequency-selective, and 
it has to be represented by multiple taps. Note that flat or frequency-selective fading 
is not a property of the channel alone, but of the relationship between the bandwidth 
W and the coherence bandwidth T d (Figure 2.13). 

The physical parameters and the time scale of change of key parameters of the 
discrete-time baseband channel model are summarized in Table 2.1. The different 
types of channels are summarized in Table 2.2. 
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Figure 2.13: (a) A channel over 200 MHz is frequency-selective, and the impulse re- 
sponse has many taps, (b) The spectral content of the same channel, (c) The same 
channel over 40 MHz is flatter, and has much fewer taps, (d) The spectral contents 
of the same channel, limited to 40 MHz bandwidth. At larger bandwidths, the same 
physical paths are resolved into a hirer resolution. 



Key Channel Parameters and Time Scales 


Symbol 


Representative Values 


carrier frequency 


fc 




1 GHz 


communication bandwidth 


W 




1 MHz 


distance between transmitter and receiver 


d 




1 km 


velocity of mobile 


V 




64 km/h 


Doppler shift for a path 


D = 


fcV 

c 


50 Hz 


Doppler spread of paths corresponding to a tap 


D s 




100 Hz 


time scale for change of path amplitude 


d 

V 




1 minute 


time scale for change of path phase 


1 

4 D 




5 ms 


time scale for a path to move over a tap 


c 

vW 




20 s 


coherence time 


T c = 


1 

4 D s 


2.5 ms 


delay spread 


T d 




1 /iS 


coherence bandwidth 


w c = 


1 

" 2 T d 


500 kHz 



Table 2.1: A summary of the physical parameters of the channel and the time scale of 
change of the key parameters in its discrete-time baseband model. 
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Types of Channel 


Defining Characteristic 


fast fading 
slow fading 
flat fading 

frequency-selective fading 
underspread 


T c -C delay requirement 
T c S> delay requirement 
W < W c 
W > W c 
T d < T c 



Table 2.2: A summary of the types of wireless channels and their defining characteris- 
tics. 

2.4 Statistical Channel Models 

2.4.1 Modeling Philosophy 

We defined Doppler spread and multipath spread in the previous section as quantities 
associated with a given receiver at a given location, velocity, and time. However, we 
are interested in a characterization that is valid over some range of conditions. That 
is, we recognize that the channel filter taps, {hg[m]} must be measured, but we want 
a statistical characterization of how many taps are necessary and how quickly they 
change. 

Such a characterization requires a probabilistic model of the channel tap values, 
perhaps gathered by statistical measurements of the channel. We are familiar with 
describing additive noise by such a probabilistic model (as a Gaussian random vari- 
able). We are also familiar with evaluating error probability while communicating over 
a channel using such models. These error probability evaluations, however, depend 
critically on the independence and Gaussian distribution of the noise variables. 

It should be clear from the description of the physical mechanisms generating 
Doppler spread and multipath spread that probabilistic models for the channel fil- 
ter taps are going to be far less believable than the models for additive noise. On the 
other hand, we need such models, even if they are quite inaccurate. Without models, 
systems are designed using experience and experimentation, and creativity becomes 
somewhat stifled. Even with highly over-simplified models, we can compare different 
system approaches and get a sense of what types of approaches are worth pursuing. 

To a certain extent, all analytical work is done with simplified models. For example, 
white Gaussian noise (WGN) is often assumed in communication models, although we 
know the model is valid only over sufficiently small frequency bands. With WGN, 
however, we expect the model to be quite good when used properly. For wireless 
channel models, however, probabilistic models are quite poor and only provide order- 
of-magnitude guides to system design and performance. We will see that we can define 
Doppler spread, multipath spread, etc. much more cleanly with probabilistic models, 
but the underlying problem remains that these channels are very different from each 
other and cannot really be characterized by probabilistic models. At the same time, 
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there is a large literature based on probabilistic models for wireless channels, and it has 
been highly useful for providing insight into wireless systems. However, it is important 
to understand the robustness of results based on these models. 

There is another question in deciding what to model. Recall the continuous-time 
multipath fading channel 



y(t) = ^2 a i( t ) x (t ~ + w(t). (2.49) 

i 

This contains an exact specification of the delay and magnitude of each path. From 
this, we derived a discrete time baseband model in terms of channel filter taps as 

y[m] = E h([m]x[m — &\ + w[m], (2.50) 

i 

where 

h([m] = J2 ai{m/W)e- j27vfcTi ^ w ^ sine [i - T^m/W)W} . (2.51) 

i 

We used the sampling theorem expansion in which x[m] = Xb(m/W ) and y[m] = 
yb(m/W). Each channel tap h([m\ contains an aggregate of paths, with the delays 
smoothed out by the baseband signal bandwidth. 

Fortunately, it is the filter taps that must be modeled for input /output descriptions, 
and also fortunately, the filter taps often contain a sufficient path aggregation so that 
a statistical model might have a chance of success. 

2.4.2 Rayleigh and Rician Fading 

The simplest probabilistic model for the channel filter taps is based on the assumption 
that there are a large number of statistically independent reflected and scattered paths 
with random amplitudes in the delay window corresponding to a single tap. The phase 
of i th path is 2irf c Ti modulo 27 t. Now, / c T; = rfj/A, where d t is the distance travelled by 
the i th path and A is the carrier wavelength. Since the reflectors and scatterers are far 
away relative to the carrier wavelength, i.e., di 3> A, it is reasonable to assume that the 
phase for each path is uniformly distributed between 0 and 27 t and that the phases of 
different paths are independent. The contribution of each path in the tap gain h( [m] 
is 

a i {m/W)e~ j2vf ^ m/W) sine [i - r^m/W)W} (2.52) 

and this can be modeled as a circular symmetric complex random variable. 7 Each 
tap h([m\ is the sum of a large number of such small independent circular symmetric 
random variables. It follows that 3 ft(/i^[m]) is the sum of many small independent real 

7 See Section A. 1.3 in Appendix A for a more in-depth discussion of circular symmetric random 
variables and vectors. 
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random variables, and so by the Central Limit Theorem, it can reasonably be modeled 
as a zero-mean Gaussian random variable. Similarly, because of the uniform phase, 
1k{h([m\e^) is Gaussian with the same variance for any fixed <f>. This assures ns that 
h([m\ is in fact circular symmetric CA/"(0, <7 % ) (see Section A. 1.3 in Appendix A for an 
elaboration). It is assumed here that the variance of hgfm ] is a function of the tap £, 
but independent of time m (there is little point in creating a probabilistic model that 
depends on time). With this assumed Gaussian probability density, we know that the 
magnitude |/i^[m]| of the tap is a Rayleigh random variable with density (c.f. (A. 20) 
in Appendix A and Exercise 2.15): 



x 





x > 0, 



(2.53) 



and the squared magnitude h( [m] | 2 is exponentially distributed with density: 



exp | — ^ 1 , x > 0. (2.54) 

<n l n) 



This model, which is called Rayleigh fading, is quite reasonable for scattering 
mechanisms where there are many small reflectors, but is adopted primarily for its 
simplicity in typical cellular situations with a relatively small number of reflectors. 
The word Rayleigh is almost universally used for this model, but the assumption is 
that the tap gains are circularly symmetric complex Gaussian random variables. 

There is a frequently used alternative model in which the line of sight path (often 
called a specular path) is large and has a known magnitude, and that there are also a 
large number of independent paths. In this case, he\m], at least for one value of £, can 
be modeled as: 



h([m = 



n 



k + 1 



a^ 6 + 



k + 1 



CAf(0, 



(Jo 



(2.55) 



with the first term corresponding to the specular path arriving with uniform phase 9 
and the second term corresponding to the aggregation of the large number of reflected 
and scattered paths, independent of 9. The parameter k (so-called A'-factor) is the 
ratio of the energy in the specular path to the energy in the scattered paths; the larger 
k is, the more deterministic is the channel. The magnitude of such a random variable 
is said to have a Rician distribution. Its density has quite a complicated form; it is 
often a better model of fading than the Rayleigh model. 
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2.4.3 Tap Gain Autocorrelation Function 

Modeling each h( [m] as a complex random variable provides part of the statistical 
description that we need, but this is not the most important part. The more important 
issue is how these quantities vary with time. As we will see in the rest of the book, the 
rate of channel variation has significant impact on several aspects of the communication 
problem. A statistical quantity that models this relationship is known as the tap gain 
autocorrelation function, R([n}. ft is defined as 

R([n] := E {hf[m\h(\m + n]} . (2.56) 

For each tap l, this gives the autocorrelation function of the sequence of random 
variables modeling that tap as it evolves in time. We are tacitly assuming that this is 
not a function of time m. Since the sequence of random variables {hi[m\} for any given 
£ has both a mean and covariance function that does not depend on m, this sequence is 
wide sense stationary. We also assume that, as a random variable, h([m ] is independent 
of hfi [m 1 ] for all £ ^ £' and all m,m' . This final assumption is intuitively plausible 8 
since paths in different ranges of delay contribute to h([m\ for different values of £. 

The coefficient Rt [0] is proportional to the energy received in the £ th tap. The 
multipath spread can be defined as the product of 1/W times the range of £ which 
contains most of the total energy ^[0] This is somewhat preferable to our pre- 
vious “definition” in that the statistical nature of becomes explicit and the reliance 
on some sort of stationarity becomes explicit. Now, we can also define the coherence 
time T c more explicitly as the smallest value of n > 0 for which R([n\ is significantly 
different from R,t [0] . With both of these definitions, we still have the ambiguity of 
what ‘significant’ means, but we are now facing the reality that these quantities must 
be viewed as statistics rather than as instantaneous values. 

The tap gain autocorrelation function is useful as a way of expressing the statistics 
for how tap gains change given a particular bandwidth W, but gives little insight 
into questions related to choice of a bandwidth for communication. If we visualize 
increasing the bandwidth, we can see several things happening. First, the ranges of 
delay that are separated into different taps £ become narrower {1/W seconds), so 
there are fewer paths corresponding to each tap, and thus the Rayleigh approximation 
becomes poorer. Second, the sine functions of (2.51) become narrower, and /b>[0] gives 
a finer grained picture of the amount of power being received in the £ th delay window 
of width 1/W . In summary, as we try to apply this model to larger W, we get more 
detailed information about delay and correlation at that delay, but the information 
becomes more questionable. 



Example 2.2: Clarke’s M odel 

8 One could argue that a moving reflector would gradually travel from the range of one tap to 
another, but as we have seen, this typically happens over a very large time-scale. 
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R 0 [n\ S(f ) 




Figure 2.15: Plots of the autocorrelation function and Doppler spectrum in Clarke’s 
model. 



This is a popular statistical model for flat fading. The transmitter is fixed, the 
mobile receiver is moving at speed v, and the transmitted signal is scattered by 
stationary objects around the mobile. The scattered path arriving at the mobile 
at the angle 9 with respect to the direction of motion has a delay of T$(t) and a 
time-invariant gain a#, and the input-output relationship is given by: 

p2i r 

y(t) = / a e x(t - r 0 (t))d9 
Jo 

Note that there is a continuum of paths: a slight generalization of the finite-path 
model we have been considering. 

The most general version of the model allows the received power distribution 
p(9) and the antenna gain pattern a (9) to be arbitrary functions of the angle 9, 
but the most common scenario assumes uniform power distribution and isotropic 
antenna gain pattern, i.e. , the amplitudes ag = a for all angles 9. This models the 
situation when the scatterers are located in a ring around the mobile (Figure 
2.14). 

Suppose the communication bandwidth W is much smaller than the reciprocal 
of the delay spread. The complex baseband channel can be represented by a single 
tap at each time: 

y[m] = ho[m]x[m] +w[m\. (2.58) 

The phase of the signal arriving at time 0 from an angle 9 is 27rf c Tg(0) mod 2n, 
where f c is the carrier frequency. Making the assumption that this phase is 



(2.57) 
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uniformly distributed in [0, 27r] and independently distributed across all angles 6, 
Exercise 2.18 shows that the process {ho[m\} is stationary, Gaussian and the 
autocorrelation function Rq [ n] is given by: 

i?o[ n ] = a 2 vrJ 0 ( nnD s /W ) (2.59) 

where Jo(-) is the 0 th -order Bessel function of the first kind: 

JJ X ) ■=- f e jxcos9 dd. (2.60) 

7T Jo 



and D s = 2 f c v/c is the Doppler spread. The power spectral density S(f), defined 
on [ — W/2, +W/2], is given by 



S(f) 



2 a 2 

B sV / l-(2//D J ) 2 

0 



-D s / 2 < / < +DJ 2 
else. 



(2.61) 



This can be verified by computing the inverse Fourier transform of (2.61) to be 
(2.59). Plots of the autocorrelation function and the spectrum for two different 
speeds are shown in Figure 2.15. We see that for faster speeds (corresponding to a 
larger Doppler spread), the autocorrelation function decays faster, and the 
spectrum has a wider spread. If we define the coherence time T c to be the value of 
n/W such that Ro[n] = 0.05i?o[0], then 

C ttDs 



(2.62) 
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i.e. , the coherence time is inversely proportional to D s . 

In Exercise 2.18, yon will also verify that S(f)df has the physical 
interpretation of the received power along paths that have Doppler shifts in the 
range [/, f + df]. Thus, S(f) is also called the Doppler spectrum. Note that S(f) 
is zero beyond the maximum Doppler shift D s j 2. 



Example 2.3: The Ultra- wideband Channel 
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Chapter 2: The Main Plot 

Large-scale fading: 

Variation of signal strength over distances of the order of cell sizes. Received 
power decreases with distance r like: 

^ (free space) 

— (reflection from ground plane). 

Decay can be even faster due to shadowing and scattering effects. 

Small-scale fading: 

Variation of signal strength over distances of the order of the carrier wavelength, 
due to constructive and destructive interference of multipaths. Key parameters: 

Doppler spread D s < > coherence time T c rsj 1 /D s 

Doppler spread is proportional to the velocity of the mobile and to the angular 
spread of the arriving paths. 

delay spread Td < — > coherence bandwidth W c ~ 1/Td 

Delay spread is proportional to the difference between the lengths of the shortest 
and the longest paths. 

Input-output channel models: 

• Continuous-time passband ((2.14)): 

y(t) = a,i(t)x{t - Ti(t)). 
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• Continuous-time complex baseband ((2.26)): 

Vb(t) = y ^aj(t)e~ j2lTfcTi{t) x b (t - 

i 

• Discrete-time complex baseband with AWGN ((2.39)): 

y[m\ = E h(\m]x[m — f\ + w[m\. 

i 

The I th tap is the aggregation of the physical paths with delays in [l/W—l/{2W),l/W-\- 

1/(2 W)\. 

Statistical channel models: 

• is modeled as circular symmetric processes independent across the taps. 

• If for all taps, 

he[m\ ~ CJ\f( 0, erf) 

the model is called Rayleigh. 

• If for one tap, 

H m \ = \[ —Tf a t eJe + \l —TF C N(0, 
y K+i V K + i 

the model is called Rician with A'-factor k. 

• The tap gain autocorrelation function Re[n] := E[/i|[0]/i^[n]] models the depen- 
dency over time. 

• The delay spread is 1 jW times the range of taps f! s which contains most of the 
total gain ^[0]- The coherence time is 1 jW times the range of n's for which 
R([n] is significantly different from R.( [0] . 
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Exercises 

Exercise 2.1. Consider the electric field in (2.4). 

1. It has been derived under the assumption that the motion is in the direction of 
the line of sight from sending antenna to receive antenna. Find the electric field 
assuming that 0 is the angle between the line-of-sight and the direction of motion 
of the receiver. Assume that the range of time of interest is small enough so that 
changes in (0, 0) can be ignored. 

2. Explain why, and under what conditions, it is a reasonable approximation to 
ignore the change in (0, ijf) over small intervals of time. 

Exercise 2.2. Equation (2.13) was derived under the assumption that r(t) ~ d. 
Derive an expression for the received waveform for general r{t). Break the first term in 
(2.11) into two terms, one with the same numerator but the denominator 2 d — r 0 — vt 
and the other with the remainder. Interpret your result. 

Exercise 2.3. In the two-path example in Sections 2.1.3 and 2.1.4, the wall is on 
the right side of the receiver so that the reflected wave and the direct wave travel 
in opposite directions. Suppose now that the reflecting wall is on the left side of 
transmitter. Redo the analysis. What is the nature of the multipath fading, both over 
time and over frequency? Explain any similarity or difference with the case considered 
in Sections 2.1.3 and 2.1.4. 

Exercise 2.4. A mobile receiver is moving at a speed v and is receiving signals arriving 
along two reflected paths which make angles 0\ and 62 with the direction of motion. 
The transmitted signal is a sinusoid at frequency /. 

1. Is the above information enough for estimating i) the coherence time T c ; ii) the 
coherence bandwidth W c ? If so express them in terms of the given parameters. 
If not, specify what additional information would be needed. 

2. Consider an environment in which there are reflectors and scatterers in all direc- 
tions from the receiver and an environment in which they are clustered within 
a small angular range. Using part (1), explain how the channel would differ in 
these two environments. 

Exercise 2.5. Consider the propagation model in Section 2.1.5 where there is a re- 
flected path from the ground plane. 

1. Let rq be the length of the direct path in Figure 2.6. Let r 2 be the length of the 
reflected path (summing the path length from the transmitter to the ground plane 
and the path length from the ground plane to the receiver). Show that r 2 — rq 
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is asymptotically equal to b/r and find the value of the constant b. Hint Recall 
that for x small, y/1 + x ~ 1 + x/2 in the sense that (a/1 + x — l) /x — ■» 1/2 as 
x — > 0. 



2. Assume that the received waveform at the receive antenna is given by 

a cos 27r [ft — fr 1 /c] a cos 27r [ft — fr 2 / c] 



E r (f,t)= 



T 1 



T 2 



(2.63) 



Approximate the denominator r 2 by rq in (2.63) and show that E r ps /3/r 2 for 
r _1 much smaller than c/f. Find the value of (3. 

3. Explain why this asymptotic expression remains valid without first approximat- 
ing the denominator r 2 in (2.63) by rq. 



EXERCISE 2.6. Consider the following simple physical model in just a single dimension. 
The source is at the origin and transmits a isotropic wave of angular frequency u>. The 
physical environment is filled with uniformly randomly located obstacles. We will 
model the inter-obstacle distance as an exponential random variable, i.e., it has the 
density 9 : 

rje~ vr , r > 0. (2.64) 

Here 1/rj is the mean distance between obstacles and captures the density of the obsta- 
cles. Viewing the source as a stream of photons, suppose each obstacle independently 
(from one photon to the other and independent of the behavior of the other obstacles) 
either absorbs the photon with probability 7 or scatters it either to the left or to the 
right (both with equal probability (1 — 7) /2). 

Now consider the path of a photon transmitted either to the left or to the right with 
equal probability from some fixed point on the line. The probability density function 
of the distance (denoted by r) to the Erst obstacle (the distance can be on either side 
of the starting point, so r takes values on the entire line) is equal to 

q(r) := r J^l j r G 7 7. (2.65) 

So the probability density function of the distance at which the photon is absorbed 
upon hitting the Erst obstacle is equal to: 



/i(r) := 7 g(r), 



r e 7 Z. 



( 2 . 66 ) 



1. Show that the probability density function of the distance from the origin at 
which the second obstacle is met is 



f 2 (r) ■= (1 - 7) q(x)fi(r - x) dx, 



r e 7 Z. 



(2.67) 



9 This random arrangement of points on a line is called a Poisson point process. 
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2. Denote by / fc (r), the probability density function of the distance from the origin 
at which the photon is absorbed by exactly the k th obstacle it hits and show the 
recursive relation: 



fk+ 1 (r)= (I -7 )q(x)f k (r -x) dx, re 11 . ( 2 . 68 ) 



3. Conclude from the previous step that the probability density function of the 
distance from the source at which the photon is absorbed (by some obstacle), 
denoted by f(r), satisfies the recursive relation: 

/ OO 

q(x)f(r — x ) dx, r e 1Z. (2.69) 

-OO 

Hint Observe that /(r) = Y1T=i /fc(' r )- 

4. Show that 

S(r) = ^VV 7 M_ (2.70) 

is a solution to the recursive relation in (2.69). Hint: Observe that the convolu- 
tion operation between the probability densities q(-) and /(•) in (2.69) is more 
easily represented under Fourier transform. 

5. Now consider the photons that are absorbed at a distance of more than r from 
the source. This is the radiated power density at a distance r and is found by 
integrating f(x) over the range (r, oo) if r > 0 and (— oo,r) if r < 0. Calculate 
the radiated power density to be 



P - ly/n\r\ 

~ 2 ~’ (271) 

and conclude that the power decreases exponentially with distance r. Also ob- 
serve that with very low absorption (7 — » 0 ) or very few obstacles (7 — > 0 ), the 
power density converges to 0.5; this is expected since the power splits equally on 
either side of the line. 

EXERCISE 2.7. In Exercise 2.6, we considered a single dimensional physical model of a 
scattering and absorption environment and concluded that power decays exponentially 
with distance. A reading exercise is to study [26] which considers a natural extension of 
this simple model to two and three dimensional spaces. Further, it extends the analysis 
to two and three dimensional physical models. While the analysis is more complicated, 
we arrive at the same conclusion: the radiated power decays exponentially in distance. 



Exercise 2.8. 
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Exercise 2.9. Assume that a communication channel first filters the transmitted 
passband signal before adding WGN. Suppose the channel is known and the channel 
filter has an impulse response h(t). Suppose that a QAM scheme with symbol duration 
T is developed without knowledge of the channel filtering. A baseband filter 6(t) is 
developed satisfying the Nyquist property that {9(t — kT)}k is an orthonormal set. 
The matched filter 9(—t) is used at the receiver before sampling and detection. 

If one is aware of the channel filter h(t), one may want to redesign either the 
baseband filter at the transmitter or the baseband filter at the receiver so that there 
is no intersymbol interference between receiver samples and so that the noise on the 
samples is i.i.d. 

1. Which filter should one redesign? 

2. Give an expression for the impulse response of the redesigned filter (assume a 
carrier frequency f c ). 

3. Draw a figure of the various filters at passband to show why your solution is 
correct. (We suggest you do this before answering the first two parts.) 

Exercise 2.10. Consider the two-path example in Section 2.1.4 with d — 2 km and 
the receiver at 1.5 km from the transmitter moving at velocity 60 km/h away from the 
transmitter. The carrier frequency is 900 MHz. 

1. Plot in MATLAB the magnitudes of the taps of the discrete-time baseband chan- 
nel at a fixed time t. Give a few plots for several bandwidths W so as to exhibit 
both flat and frequency-selective fading. 

2. Plot the time variation of the phase and magnitude of a typical tap of the discrete- 
time baseband channel for a bandwidth where the channel is (approximately) flat 
and for a bandwidth where the channel is frequency selective. How do the time- 
variations depend on the bandwidth? Explain. 

EXERCISE 2.11. For each tap of the discrete time channel response, the Doppler spread 
is the range of Doppler shifts of the paths contributing to that tap. Give an example 
of an environment (i.e., location of reflectors/scatterers with respect to the location of 
the transmitter and the receiver) in which the Doppler spread is the same for different 
taps and an environment in which they are different. 

Exercise 2.12. Verify (2.40) and (2.41). 

EXERCISE 2.13. In this problem we consider generating passband orthogonal wave- 
forms from baseband ones. 
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1. Show that if the waveforms {9(t — nT)} n form an orthogonal set, then the wave- 
forms {'tp n ,i,' l Pn, 2 }n also form an orthogonal set, provided that 9{t) is bandlimited 
to [-fcfd- Here, 

ip n , i(t) = 9(t — nT) cos 2nf c t 
■0 n>2 (t) = 9{t — nT) sm2nf c t. 

How should we normalize the energy of 9(t) to make the ^(f)’ s orthcniormaU 

2. For a given f c , hnd an example where the result in part 1 is false when the 
condition that 9{t) is bandlimited to [— f c , f c } is violated. 

EXERCISE 2.14. Verify (2.25). Does this equation contain any more information about 
the communication system in Figure 2.9 beyond what is in (2.24)? Explain. 

EXERCISE 2.15. Compute the probability density function of the magnitude \X\ of a 
complex circular symmetric Gaussian random variable X with variance a 2 . 



Exercise 2.16. In the text we have discussed the various reasons why the channel tap 
gains he[m] vary in time (as a function of m) and how the various dynamics operate 
at different time-scales. The analysis is based on the assumption that communication 
takes place in a bandwidth W around a carrier frequency f c , with f c 3> W. This 
assumption is not valid for ultra-wideband communication systems, where the trans- 
mission bandwidth is from 3.1 GHz to 10.6 GHz, as regulated by the FCC. Redo the 
analysis for this system. What is the main mechanism that causes the tap gains to 
vary at the fastest time-scale, and what is this fastest time-scale determined by? 



Exercise 2.17. Give a convincing argument why it is reasonable to assume that the 
complex random vector 

/ h e [m] \ 
hf{m + 1] 



h := 



y h(\m + n\ J 

is circular symmetric. Here, h(\rn\ is the complex 0 th tap at time m of the baseband 
model for the multipath fading channel. 



EXERCISE 2.18. In this question, we will analyze in detail Clarke’s one-ring model 
discussed at the end of the chapter. Recall that the scatterers are assumed to be 
located in a ring around the receiver moving at speed v. The path coming at angle 
9 with respect to the direction of motion of the mobile has a delay of Tg(t) and a 
time- invariant gain a (not dependent on the angle), and the input-output relationship 
is given by: 

p2ty 

y(t ) = / ax(t — Tg(t))d9. 

Jo 



(2.72) 
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1. Give an expression for the impulse response h(r,t ) for this channel, and give an 
expression for Tg{t) in terms of Tg(0). (You can assume that the distance the 
mobile travelled in [0,f] is small compared to the radius of the ring.) 

2. Suppose communication takes place at carrier frequency f c and over a narrow- 
band of bandwidth W such that the delay spread of the channel satisfies 
Td -C 1 jW . Argue that the discrete-time baseband model can be approximately 
represented by a single tap: 

y[m] = h[m]x[m] +w[m] (2.73) 

and give an approximate expression for that tap in terms of the ae s and s. 
Hint Your answer should contain no sine functions. 

3. Argue that it is reasonable to assume that the phase of the path from an angle 
6 at time 0, 

2vr f c Tg (0) (mod27r ) 

is uniformly distributed in [0, 27r] and that it is i.i.d. across 9. 

4. Show that, based on the assumptions in part (3), {h[m\} is a stationary Gaussian 
process. Verify that the autocorrelation function A 0 [n] is given by (2.59). 

5. Verify that the Doppler spectrum S(f ) is given by (2.61). Hint. It is easier to 
show that the inverse Fourier transform of (2.61) is (2.59). 

6. Verify that S(f)df is indeed the received power from the paths that have Doppler 
shifts in [/, f + df]. Is this surprising? 



Exercise 2.19. Consider a one-ring model where the scatterers are located in a contin- 
uum and uniformly on a circle of radius 1 km around the receiver and the transmitter 
is 2 km away. The transmit power is P. The power attenuation along a path from the 
transmitter to a scatterer to the receiver is 



G ■ 




1 

^2 ’ 



(2.74) 



where G is a constant and r and s are the distance from the transmitter to the scatterer 
and the distance from the scatterer to the receiver respectively. Communication takes 
place at a carrier frequency f c = 1.9 GHz and the bandwidth is W Hz. You can assume 
that, at any time, the phases of each arriving path in the baseband representation of 
the channel are independent and uniformly distributed between 0 and 27 r. 



1. What are the key differences and the similarities between this model and the 
Clarke’s model in the text? 
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2. Find approximate conditions on the bandwidth W for which one gets a flat fading 
channel. 

3. Suppose the bandwidth is such that the channel is frequency selective. Find 
approximately the amount of power in tap i of the discrete-time baseband im- 
pulse response of the channel (i.e., compute the power-delay profile.). Make any 
simplifying assumptions but state them. (You can leave your answers in terms 
of integrals if you cannot evaluate them.) 

4. Compute and sketch the power-delay profile as the bandwidth becomes very 
large. 

5. Suppose now the receiver is moving at speed v towards the (fixed) transmitter. 
What is the Doppler spread of tap C! Argue heuristically from physical con- 
siderations what the Doppler spectrum (i.e., power spectral density) of tap f 
is. 

6. We have made the assumptions that the scatterers are all on a circle of radius 1 
km around the receiver and the paths arrive at uniform phases at the receiver. 
Mathematically, are the two assumptions consistent? If not, do you think it 
matters, in terms of the validity of your answers to the earlier parts of this 
question? 

Exercise 2.20. In this exercise, we study the effect of correlation between the mobile 
and the base station antennas. Often in modeling multiple input multiple output 
(MIMO) fading channels the fading coefficients between different transmit and receive 
antennas are assumed to be independent random variables. This problem explores 
whether this is a reasonable assumption based on Clarke’s one ring scattering model 
and the antenna separation. 

1. (Antenna separation at the mobile) Assume a mobile with velocity v moving 
away from the base station, with uniform scattering from the ring around it. 

(a) Compute the Doppler spread D s for a carrier frequency f c , and the corre- 
sponding coherence time T c . 

(b) Assuming that fading states separated by T c are approximately uncorre- 
lated, at what distance should we place a second antenna at the mobile 
to get an independently faded signal? Hint : How much distance does the 
mobile travel in T c ? 

2. (Antenna separation at the base station) Assume that the scattering ring has 
radius R and that the distance between the base station and the mobile is d. 
Further assume for the time being that the base station is moving away from the 
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mobile with velocity v' . Repeat the previous part to find the minimum antenna 
spacing at the base station for uncorrelated fading. Flint. Is the scattering still 
uniform around the base station ? 

3. Typically, the scatterers are locally around the mobile and far away from the 
base station. What is the implication of your result in part (2) for this scenario? 




Chapter 3 

Point-to-Point Communication: 
Detection, Diversity and Channel 
Uncertainty 



In this chapter we look at various basic issues that arise in communication over fading 
channels. We start by analyzing uncoded transmission in a narrowband fading channel. 
We study both coherent and noncoherent detection. In both cases the error probability 
is much higher than in a non-faded AWGN channel. The reason is that there is 
a significant probability that the channel is in a deep fade. This motivates us to 
investigate various diversity techniques which improve the performance. The diversity 
techniques discussed operate over time, frequency or space, but the basic idea is the 
same. By sending signals that carry the same information through different paths, 
multiple independently faded replicas of data symbols are obtained at the receiver 
end and more reliable detection can be achieved. The simplest diversity schemes use 
repetition coding. More sophisticated schemes exploit channel diversity and, at the 
same time, efficiently use the degrees of freedom in the channel. Compared to repetition 
coding, they provide coding gains in addition to diversity gains. In space diversity, we 
look at both transmit and receive diversity schemes. In frequency diversity, we look at 
three approaches: 

• single-carrier with inter-symbol interference equalization, 

• direct sequence spread spectrum, 

• orthogonal frequency division multiplexing. 

Finally, we study the impact of channel uncertainty on the performance of diversity 
combining schemes. We will see that in some cases, having too many diversity paths 
can have an adverse effect clue to channel uncertainty. 

To familiarize ourselves with the basic issues, the emphasis of this chapter is on 
concrete techniques for communication over fading channels In Chapter 5 we take a 
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more fundamental and systematic look and use information theory to derive the best 
performance one can achieve. At that fundamental level, we will see many of the issues 
discussed here recur. 

The derivations in this chapter make repeated use of a few key results in vector 
detection under Gaussian noise. We develop and summarize the basic results in Ap- 
pendix A, emphasizing the underlying geometry. The reader is encouraged to take 
a look at the appendix before proceeding with this chapter and to refer back to it 
often. In particular, a thorough understanding of the canonical detection problem in 
Summary 2 will be very useful. 



3.1 Detection in a Rayleigh Fading Channel 

3.1.1 Noncoherent Detection 



We start with a very simple detection problem in a fading channel. For simplicity, 
let us assume a flat fading model where the channel can be represented by a single 
discrete-time complex filter tap ho\m], which we abbreviate as h[m]: 

y[m] = h[m]x[m] + w[m], (3.1) 



where w[m\ ~ CAf(0,N o ). We suppose Rayleigh fading; i.e. , h[m\ ~ CJ\T{ 0,1), where 
we normalize the variance to be 1. For the time being, however, we do not specify the 
dependence between the fading coefficients h [m] ’s at different times m nor do we make 
any assumption on the prior knowledge the receiver might have of h[m\ s. (This latter 
assumption is sometimes called noncoherent communication.) 

First consider uncoded binary antipodal signaling (or binary phase-shift-keying, 
BPSK) with amplitude a, i.e., x[m\ = ±a, and the symbols x[m] are independent over 
time. This signaling scheme fails completely, even in the absence of noise, since the 
phase of the received signal y[m] is uniformly distributed between 0 and 2tt regardless 
of whether x[m] = a or x[m\ = —a is transmitted. Further, the received amplitude 
is independent of the transmitted symbol. Binary antipodal signaling is binary phase 
modulation and it is easy to see that phase modulation in general is similarly flawed. 
Thus, signal structures are required in which either different signals have different 
magnitudes, or coding between symbols is used. Next we look at orthogonal signaling, 
a special type of coding between symbols. 

Consider the following simple orthogonal modulation scheme: a form of binary 
pulse-position modulation. For a pair of time samples, either transmit 



xa := 




(3.2) 



x B 



a 



(3.3) 
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We would like to perform detection based on 




(3.4) 



This is a simple hypothesis testing problem, and it is straightforward to derive the 
maximum likelihood (ML) rule: 



XA 

A(y) ^ 0, 



(3.5) 



where A(y) is the log- likelihood ratio 



XB 



A(y) := In 



fjyM ] 
/( yl x B) J ’ 



(3.6) 



It can be seen that, if x A is transmitted, t/[0] ~ CAf( 0, a 2 + N 0 ) and y[ 1] ~ CAf(0 , N 0 ) 
and 2/[0], 2/[l] are independent. Similarly, ifx B is transmitted, y[0] ~ CAf(0,N o ) and 
y[ i] rs_/ CJ\f(0 , a 2 + N 0 ). Further, y[0] and t/[l] are independent. Hence the log-likelihood 
ratio can be computed to be 

{m\ 2 -m\ 2 }a 2 

(a 2 + N 0 )N 0 

The optimal rule is simply to decide x A is transmitted if |?/[0]| 2 > |t/[l]| 2 and decide x B 
otherwise. Note that the rule does not make use of the phases of the received signal, 
since the random unknown phases of the channel gains /i[0],/i[l] render them useless 
for detection. Geometrically, we can interpret the detector as projecting the received 
vector y onto each of the two possible transmit vectors x^ and x^ and comparing the 
energies of the projections (Figure 3.1). Thus, this detector is also called an energy or 
a square-law detector. It is somewhat surprising that the optimal detector does not 
depend on how h[ 0] and /r[l] are correlated. 

We can analyze the error probability of this detector. By symmetry, we can assume 
that xa is transmitted. Under this hypothesis, y[0] and y[l] are independent circular 
symmetric complex Gaussian random variables with variances a 2 + N 0 and N 0 respec- 
tively. (See Section A. 1.3 in the appendices for a discussion on circular symmetric 
Gaussian random variables and vectors.) As shown there, |y[0]| 2 , |t/[l]| 2 are exponen- 
tially distributed with mean a 2 + and IVo respectively. 1 The probability of error 
can now be computed by direct integration: 




(3.7) 



Pe = P{||/[1]| 2 > ||/[0]| 2 |X A } 



2 + 



N 0 



(3.8) 



1 R.ecall that a random variable U is exponentially distributed with mean y if its pdf is fij{u) = 

Lp-u/l u 

w 
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Figure 3.1: The noncoherent detector projects the received vector y onto each of 
the two orthogonal transmitted vectors and xb and compares the lengths of the 
projections. 
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We make the general definition 



SNR : = 



average received signal energy per (complex) symbol time 
noise energy per (complex) symbol time 



(3.9) 



which we use consistently throughout the book for any modulation scheme. The noise 
energy 2 per complex symbol time is Nq. For the orthogonal modulation scheme here, 
the average received energy per symbol time is a 2 / 2 and so 



SNR : = 



2iV n 



(3.10) 



Substituting into (3.8), we can express the error probability of the orthogonal scheme 
in terms of SNR: 



1 

Pe= 2(1 + SNR) 



(3.11) 



This is a very discouraging result. To get an error probability p e = 10 -3 one would 
require SNR ^ 500 (27 dB). Stupendous amounts of power would be required for more 
reliable communication. 



3.1.2 Coherent Detection 

Why is the performance of the noncoherent maximum likelihood (ML) receiver on a 
fading channel so bad? It is instructive to compare its performance with detection in 
an AWGN channel without fading: 

y[m\ = x[m\ + w[m\. (3.12) 

For antipodal signaling (BPSK), x[m\ = ±a, a sufficient statistic is 5 R{y[m]} and the 
error probability is 

p ' = Q (yi7i) =Q (^)’ (313) 

where SNR = a 2 /Nq is the received signal to noise ratio per symbol time, and Q(-) 
is the complementary cumulative distribution function of an N( 0, 1) random variable. 
This function decays exponentially with x 2 ; more specifically, 

Q(x) < e -a;2 / 2 , x > 0 (3-14) 

2 The orthogonal modulation scheme considered here uses only real symbols and hence transmits 

only on the I-channel. Hence it may seem more natural to define the SNR. in terms of noise energy 

per real symbol, i.e., No/2. However, later we will consider modulation schemes that use complex 
symbols and hence transmit on both the I and Q channels. In order to be consistent throughout, we 
choose to define SNR. this way. 
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and 

Q(x) > -F- (l - W x > 1. (3.15) 

V27 tx V x J 

Thus, the detection error probability decays exponentially in SNR in the AWGN channel 
while it decays only inversely with the SNR in the fading channel. To get an error 
probability of ICR 3 , an SNR of about 7 dB is needed in an AWGN channel (as compared 
to 27 dB in the noncoherent channel). Note that 2y / SNR is the separation between the 
two constellation points as a multiple of the standard deviation of the Gaussian noise; 
the above observation says that when this separation is much larger than 1, the error 
probability is very small. 

Compared to detection in the AWGN channel, the detection problem considered in 
the previous section has two differences: the channel gains h[m \' s are random, and the 
receiver is assumed not to know them. Suppose now that the channel gains are tracked 
at the receiver so that they are known at the receiver (but still random). In practice, 
this is done either by sending a known sequence (called a pilot or training sequence) or 
in a decision directed manner, estimating the channel using symbols detected earlier. 
The accuracy of the tracking depends, of course, on how fast the channel varies. For 
example, in a narrowband 30kHz channel (such as that used in the North American 
TDMA cellular standard IS-136) with a Doppler spread of 100Hz, the coherence time 
T c is roughly 80 symbols and in this case there is plenty of time to estimate the channel 
with minimal overhead expended in the pilot. 3 For our current purpose, let us suppose 
that the channel estimates are perfect. 

Knowing the channel gains, coherent detection of BPSK can now be performed on 
a symbol by symbol basis. We can focus on one symbol time and drop the time index 

y = hx + w (3.16) 

Detection of x from y can be done exactly as in the AWGN case, except that the 
decision is now based on the sign of the real sufficient statistic 

r := lR{(h/\h\)* y} = \h\x + z, (3-17) 

where z ~ iV(0, N 0 /2). If the transmitted symbol is x = ±a, then for a given value of 
h, the error probability of detecting x is 




where SNR = a 2 /No is the average received signal-to-noise ratio per symbol time. 
(Recall that we normalized the channel gain such that E[|/i| 2 ] = 1.) We average over 

3 The channel estimation problem for a broadband channel with many taps in the impulse response 
is more difficult; we will get to this in Section 3.5. 
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the random gain h to find the overall error probability. For Rayleigh fading when 
h ~ CM{ 0, 1), direct integration yields 



Pe = 



E 



Q ("a/2|/i| 2 SNR 



CCS) 



(3.19) 



(See Exercise 3.1.) Figure 3.2 compares the error probabilities of coherent BPSK and 
noncoherent orthogonal signaling over the Rayleigh fading channel, as well as BPSK 
over the AWGN channel. We see that while the error probability for BPSK over the 
AWGN channel decays very fast with the SNR, the error probabilities for the Rayleigh 
fading channel are much worse, whether the detection is coherent or noncoherent. At 
high SNR, Taylor series expansion yields 



SNR 
1 + SNR 



2SNR 



+ 0 




(3.20) 



Substituting into (3.19), we get the approximation 



1 

4SNR’ 



(3.21) 



which decays inversely proportional to the SNR, just as in the noncoherent orthogonal 
signaling scheme (c.f. (3.11)). There is only a 3 dB difference in the required SNR 
between the coherent and noncoherent schemes; in contrast, at an error probability of 
10 -3 , there is a 17 dB difference between the performance on the AWGN channel and 
coherent detection on the Rayleigh fading channel. 4 

We see that the main reason why detection in fading channel has poor performance 
is not because of the lack of knowledge of the channel at the receiver. It is due to 
the fact that the channel gain is random and there is a significant probability that the 
channel is in a “deep fade”. At high SNR, we can in fact be more precise about what 
a “deep fade” means by inspecting (3.18). The quantity | A | 2 S N R is the instantaneous 
received SNR. Under typical channel conditions, i.e., | A | 2 S N R 3> 1, the conditional 
error probability is very small, since the tail of the Q-function decays very rapidly. In 
this regime, the separation between the constellation points is much larger than the 
standard deviation of the Gaussian noise. On the other hand, when | A | 2 S N R is of the 
order of 1 or less, the separation is of the same order as the standard deviation of the 
noise and the error probability becomes significant. The probability of this event is 



r 1/SNR 

P{|/r| 2 SNR < 1} = / e~ x dx (3.22) 

Jo 

4 Communication engineers often compare schemes based on the difference in the required SNR to 
attain the same error probability. This corresponds to the horizontal gap between the error probability 
versus SNR curves of the two schemes. 
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Figure 3.2: Performance of coherent BPSK vs noncoherent orthogonal signaling over 
Rayleigh fading channel vs BPSK over AWGN channel. 




(3.23) 



This probability has the same order of magnitude as the error probability itself (c.f. 
(3.21)). Thus, we can define a ’’deep fade” via an order-of-magnitude approximation 



Deep fade event: |/?,| 2 < 
P {deep fade} ~ gW. 



We conclude that high-SNR error events most often occur because the channel is in 
deep fade and not as a result of the additive noise being large. In contrast, in the 
AWGN channel the only possible error mechanism is for the additive noise to be large. 
Thus, the error probability performance over the AWGN channel is much better. 

We have used the explicit error probability expression (3.19) to help identify the 
typical error event at high SNR. We can in fact turn the table around and use it as 
a basis for an approximate analysis of the high-SNR performance (Exercises 3.2 and 
3.3). Even though the error probability p e can be directly computed in this case, 
the approximate analysis provides much insight as to how typical errors occur. Under- 
standing typical error events in a communication system often suggests how to improve 
it. Moreover, the approximate analysis gives some hints as to how robust the conclu- 
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sion is to the Rayleigh fading model. In fact, the only aspect of the Rayleigh fading 
model that is important to the conclusion is the fact that P{|h| 2 < e} is proportional 
to e for e small. This holds whenever the pdf of |/?,| 2 is positive and continuous at 0. 

3.1.3 From BPSK to QPSK: Exploiting the Degrees of Free- 
dom 

In Section 3.1.2, we have considered BPSK modulation, x[m] = ±a. This uses only the 
real dimension (the I channel), while in practice both the I and Q channels are used 
simultaneously in coherent communication, increasing spectral efficiency. Indeed, an 
extra bit can be transmitted by instead using QPSK (quadrature phase shift keying) 
modulation, i.e., the constellation is 

{a(l + j),a(l - j), a(-l + j), a(-l - ])} ; (3.24) 

in effect, a BPSK symbol is transmitted on each of the I and Q channels simultaneously. 
Since the noise is independent across the I and Q channels, the bits can be detected 
separately and the bit error probability on the AWGN channel (c.f. (3.12)) is 



Q 




(3.25) 



the same as BPSK (c.f. (3.13)). For BPSK, the SNR (as defined in (3.9)) is given by 



SNR=^, (3.26) 

while for QPSK, 

2 a 2 

SNR = (3.27) 

JV 0 



is twice that of BPSK since both the I and Q channels are used. Equivalently, for 
a given SNR, the bit error probability of BPSK is Q(V2SNR) (c.f. (3.13)) and that 
of QPSK is Q(V / SNR). The error probability of QPSK under Rayleigh fading can be 
similarly obtained by replacing SNR by SNR/2 in the corresponding expression (3.19) 
for BPSK to yield 



1 / / SNR \ _ 1 

2 V 1 V 2 + SNR J ~ 2SNR' 



(3.28) 



at high SNR. For expositional simplicity, we will consider BPSK modulation in much 
of the discussions in this chapter, but the results can be directly mapped to QPSK 
modulation. 
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Figure 3.3: QPSK versus 4-PAM: for the same minimum separation between constel- 
lation points, the 4-PAM constellation requires higher transmit power. 



One important point worth noting is that it is much more energy-efficient to use 
both the I and Q channels rather than just one of them. For example, if we had to send 
the two bits carried by the QPSK symbol on the I channel alone, then we would have 
to transmit a 4-PAM symbol. The constellation is {-36,-6,6,36} and the average 
error probability on the AWGN channel is 




To achieve approximately the same error probability as QPSK, the argument inside 
the Q-function should be the same as that in (3.25) and hence 6 should be the same as 
a, i.e., the same minimum separation between points in the two constellations (Figure 
3.3). But QPSK requires a transmit energy of 2 a 2 per symbol, while 4-PAM requires a 
transmit energy of 56 2 per symbol. Hence, for the same error probability, approximately 
2.5 times more transmit energy is needed: a 4 dB worse performance. Exercise 3.4 
shows that this loss is even more significant for larger constellations. The loss is due 
to the fact that it is more energy-efficient to pack, for a desired minimum distance 
separation, a given number of constellation points in a higher-dimensional space than 
in a lower-dimensional space. We have thus arrived at a general design principle (c.f. 
Discussion 1): 



A good communication scheme exploits all the available degrees of freedom in 
the channel. 
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Scheme 


Bit Error Prob. (High SNR) 


Data Rate (bits/s/Hz) 


Coherent BPSK 


i 

4SNR 


1 


Coherent QPSK 


1 

2SNR 


2 


Coherent 4-PAM 


5 

4SNR 


2 


Coherent 16-QAM 


5 

2SNR 


4 


Noncoherent orth. mod. 


2SNR 


1/2 


Differential BPSK 


2SNR 


1 


Differential QPSK 


SNR 


2 



Table 3.1: Performance of coherent and noncoherent schemes under Rayleigh fading. 
The data rates are in bits/s/Hz, which is the same as bits per complex symbol time. 
The performance of differential QPSK is derived in Exercise 3.5. It is also 3-dB worse 
than coherent QPSK. 



This important principle will recur throughout the book, and in fact will be shown 
to be of a fundamental nature as we talk about channel capacity in Chapter 5. Here, 
the choice is between using just the I channel and using both the I and Q channels, 
but the same principle applies to many other situations. As another example, the 
noncoherent orthogonal signaling scheme discussed in Section 3.1.1 conveys one bit 
of information and uses one real dimension per two symbol times (Figure 3.4). This 
scheme does not assume any relationship between consecutive channel gains, but if we 
assume that they do not change much from symbol to symbol, an alternative scheme 
is differential BPSK, which conveys information in the relative phases of consecutive 
transmitted symbols. That is, if the BPSK information symbol is u\m] at time m 
(■ u[m\ = ±1), the transmitted symbol at time m is given by 

x[m] = u[m]x[m — 1]. (3.30) 

Exercise 3.5 shows that differential BPSK can be demodulated noncoherently at the 
expense of a 3dB loss in performance compared to coherent BPSK (at high SNR). But 
since noncoherent orthogonal modulation also has a 3dB worse performance compared 
to coherent BPSK, this implies that differential BPSK and noncoherent orthogonal 
modulation have the same error probability performance. On the other hand, differ- 
ential BPSK conveys one bit of information and uses one real dimension per smgle 
symbol time, and therefore has twice the spectrally efficiency of orthogonal modula- 
tion. Better performance is achieved because differential BPSK uses more efficiently 
the available degrees of freedom. 
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Figure 3.4: Geometry of orthogonal modulation. Signalling is performed over one real 
dimension, but two (complex) symbol times are used. 
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3.1.4 Diversity 

The performance of the various schemes considered so far for fading channels is sum- 
marized in Table 3.1. Some schemes are spectrally more efficient than others, but 
from a practical point of view, they are all bad: the error probabilities all decay very 
slowly, like 1/SNR. From Section 3.1.2, it can be seen that the root cause of this poor 
performance is that reliable communication depends on the strength of a single signal 
path. There is a significant probability that this path will be in a deep fade. When 
the path is in a deep fade, any communication scheme will likely suffer from errors. A 
natural solution to improve the performance is to ensure that the information symbols 
pass through multiple signal paths, each of which fades independently, making sure 
that reliable communication is possible as long as one of the paths is strong. This 
technique is called diversity , and it can dramatically improve the performance over 
fading channels. 

There are many ways to obtain diversity. Diversity over time can be obtained via 
coding and interleaving : information is coded and the coded symbols are dispersed over 
time in different coherence periods so that different parts of the codewords experience 
independent fades. Analogously, one can also exploit diversity over frequency if the 
channel is frequency-selective. In a channel with multiple transmit or receive antennas 
spaced sufficiently far enough, diversity can be obtained over space as well. In a 
cellular network, macrodiversity can be exploited by the fact that the signal from 
a mobile can be received at two base-stations. Since diversity is such an important 
resource, a wireless system typically uses several types of diversity. 

In the next few sections, we will discuss diversity techniques in time, frequency and 
space. In each case, we start with a simple scheme based on repetition coding : the 
same information symbol is transmitted over several signal paths. While repetition 
coding achieves the maximal diversity gain, it is usually quite wasteful of the degrees 
of freedom of the channel. More sophisticated schemes can increase the data rate and 
achieve a coding gain along with the diversity gain. 

To keep the discussion simple we begin by focusing on the coherent scenario: the 
receiver has perfect knowledge of the channel gains and can coherently combine the 
received signals in the diversity paths. As discussed in the previous section, this knowl- 
edge is learnt via training (pilot) symbols and the accuracy depends on the coherence 
time of the channel and the received power of the transmitted signal. We discuss 
the impact of channel measurement error and noncoherent diversity combining in Sec- 
tion 3.5. 



3.2 Time Diversity 

Time diversity is achieved by averaging the fading of the channel over time. Typically, 
the channel coherence time is of the order of 10’s to 100’s of symbols, and therefore 
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I h 



L = A 




Figure 3.5: The codewords are transmitted over consecutive symbols (top) and inter- 
leaved (bottom). A deep fade will wipe out the entire codeword in the former case 
but only one coded symbol from each codeword in the latter. In the latter case, each 
codeword can still be recovered from the other three unfaded symbols. 



the channel is highly correlated across consecutive symbols. To ensure that the coded 
symbols are transmitted through independent or nearly independent fading gains, in- 
terleaving of codewords is required (Figure 3.5). For simplicity, let us consider a flat 
fading channel. We transmit a codeword x = [aq, . . . , aq,]* of length L symbols and the 
received signal is given by 



Hi = htrxi + we, £ = 1,...,L. (3.31) 

Assuming ideal interleaving so that consecutive symbols aq are transmitted sufficiently 
far apart in time, we can assume that the he' s are independent. The parameter L is 
commonly called the number of diversity branches. The additive noises wq, ... , uq are 
i.i.d. CJ\f (Q,Nq) random variables. 
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3.2.1 Repetition Coding 

The simplest code is a repetition code, in which xi = ap for i = 1, . . . , L. In vector 
form, the overall channel becomes 



y = hap + w, (3.32) 

where y = [yi , . . . , y L ]\ h = [h u ..., h L Y and w = [up, . . . , wl]*. 

Consider now coherent detection of op, i.e., the channel gains are known to the 
receiver. This is the canonical vector Gaussian detection problem in Summary 2 of 
Appendix A. The scalar 

j^y = l|h||ap + (3.33) 

is a sufficient statistic. Thus, we have an equivalent scalar detection problem with noise 
( h 7||h||)w rs_/ CM{ 0, Nq 1 l ). The receiver structure is a matched filter and is also called 
a maximal ratio combiner, it weighs the received signal in each branch in proportion 
to the signal strength and also aligns the phases of the signals in the summstion to 
maximize the output SNR. This receiver structure is also called coherent combining. 

Consider BPSK modulation, with ap = ±o. The error probability, conditioned on 
h, can be derived exactly as in (3.18): 

Q (V2||h|| 2 SNR) (3.34) 

where as before SNR = a 2 /N 0 is the average received signal-to-noise ratio per (complex) 
symbol time, and ||h|| 2 SNR is the received SNR for a given channel vector h. We 
average over ||h|| 2 to find the overall error probability. Under Rayleigh fading with 
each gain hi i.i.d. CA/"((), 1), 

L 

M 2 = £N 2 < 3 - 35 > 

1 = l 

is a sum of the squares of 2 L independent real Gaussian random variables, each term 
\h(\ 2 being the sum of the squares of the real and imaginary parts of hi. It is Chi-square 
distributed with 2 L degrees of freedom, and the density is given by 

f(x) = ^ ^ x L ~ l e~ x , x > 0. (3.36) 

The average error probability can be explicitly computed to be (c.f. Exercise 3.6: 

Q ^V / 2xSNR^ f(x) dx, 




P, 
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Figure 3.6: Error probability as a function of SNR for different numbers of diversity 
branches L. 



where 



h := 



SNR 
1 + SNR 



(3.38) 



The error probability as a function of the SNR for different numbers of diversity 
branches L is plotted in Figure 3.6. Increasing L dramatically decreases the error 
probability. 

At high SNR, we can see the role of L analytically: consider the leading term in 
the Taylor series expansion in 1/SNR to arrive at the approximations 



Furthermore, 



Hence, 




(3.39) 



(3.40) 



(3.41) 



at high SNR. In particular, the error probability decreases as the L th power of SNR, 
corresponding to a slope of — L in the error probability curve (in dB/dB scale). 

To understand this better, we examine the probability of the deep fade event, as in 
our analysis in Section 3.1.2. The typical error event at high SNR is when the overall 
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channel gain is small. This happens with probability 



P{||h|| 2 < 1/SNR} . (3.42) 

Figure 3.7 plots the distribution of ||h|| 2 for different values of L; clearly the tail of 
the distribution near zero becomes lighter for larger L. For small x, the probability 
density function of ||h|| 2 is approximately 

f( x ) ~ iy xL1 ( 3 - 43 ) 



and so 



P{||h|| 2 < 1/SNR} « 




(L-l)! 



X 



L-l 



dx 



1 1 
74 SNR l 



(3.44) 



This analysis is too crude to get the correct constant before the 1/SNR L term in (3.41), 
but does get the correct exponent L. Basically, an error occurs when Y^=i 1^1 2 is of the 
order or smaller than 1/SNR, and this happens when all the magnitudes of the gains 
|/^| 2, s are small, on the order of 1/SNR. Since the probability that each \h(\ 2 is less 
than 1/SNR is approximately 1/SNR and the gains are independent, the probability of 
the overall gain being small is of the order 1/SNR L . Typically, L is called the diversity 
gain of the system. 



3.2.2 Beyond Repetition Coding 



The repetition code is the simplest possible code. Although it achieves a diversity gain, 
it does not exploit the degrees of freedom available in the channel effectively because it 
simply repeats the same symbol over the L symbol times. By using more sophisticated 
codes, a coding gain can also be obtained beyond the diversity gain. There are many 
possible codes that one can use. We first focus on the example of a rotation code to 
explain some of the issues in code design for fading channels. 

Consider the case L — 2. A repetition code which repeats a BPSK symbol u = ±a 
twice obtains a diversity gain of 2 but would only transmit one bit of information over 
the two symbol times . Transmitting two independent BPSK symbols Ui,u 2 over the 
two times would use the available degrees of freedom more efficiently, but of course 
offers no diversity gain: an error would be made whenever one of the two channel gains 
hi, h -2 is in deep fade. To get both benefits, consider instead a scheme that transmits 
the vector 

I? r 

x = K 

U 2 



(3.45) 



over the two symbol times, where 



R := 



cos 9 — sin 9 
sin 9 cos 9 



(3.46) 
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Figure 3.7: The probability density function of || h || 2 for different values of L. The 
larger the L, the faster the probability density function drops off around 0. 

is a rotation matrix (for some 9 G (0, 2n)). This is a code with 4 codewords: 

xa = R ° ,x b = R ,x g = R ,x d = R ° ; (3.47) 

a J [ a J [ —a J [ —a 

they are shown in Figure 3.8(a). 5 The received signal is given by 

ye = h e xe + we, £ = 1, 2. (3.48) 

It is difficult to obtain an explicit expression for the exact error probability. So, we 
will proceed by looking at the union bound. Due to the symmetry of the code, without 
loss of generality we can assume xa is transmitted. The union bound says that 

Pe < p {xa -> X B } + P {xa -> X G } + P (xa -»• X D } (3.49) 

where P{xa — » ► x#} is the pairwise error probability of confusing xa with xg when 
xa is transmitted and when these are the only two hypotheses. Conditioned on the 

5 Here communication is over the (real) I channel since both x± and X 2 are real, but as in Section 
3.1.3, the spectral efficiency can be doubled by using both the I and the Q channels. Since the two 
channels are orthogonal, one can apply the same code separately to the symbols transmitted in the 2 
channels to get the same performance gain. 
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Here we have used the fact that the moment generating function for a unit mean 
exponential random variable X is E[e sX ] = 1/(1 — s) for s < 1. While it is possible 
to get an exact expression for the pairwise error probability, this upper bound is more 
explicit; moreover, it is asymptotically tight at high SNR (Exercise 3.7). 



We first observe that if d\ = 0 or = 0, then the diversity gain of the code is only 
1. If they are both nonzero, then at high SNR the above bound on the pairwise error 
probability becomes 




the squared product distance between and xg, when the average energy of the code 
is normalized to be 1 per symbol time (c.f. (3.52)). This determines the pairwise error 
probability between the two codewords. Similarly, we can define 8 VJ to be the squared 
product distance between x* and Xj, i,j = A,B,C,D. Combining (3.55) with (3.49) 
yields a bound on the overall error probability: 



p e < 16^- l? f? 1 SNR 2 , 
\<JAB Oac Oad ) 



< — —SNR' 2 . 

nnn j=B,c,D A j 



We see that as long as S tl > 0 for all i,j, we get a diversity gain of 2. The minimum 
squared product distance min j=b,c,d $Aj then determines the coding gain of the scheme 
beyond the diversity gain. This parameter depends on 9, and we can optimize over 9 
to maximize the coding gain. Here 



5ab = 8 ad = 4 sin 2 29, and 8ac — 16 cos 2 2d. (3.58) 



The angle 9* that maximizes the minimum squared product distance makes 8ab equal 
Sac j yielding 9* = (1/2) tan -1 2 and ruin S t j = 16/5. The bound in (3.57) now becomes 

p e < 15SNR' 2 . (3.59) 



To get more insight into why the product distance is important, we see from (3.51) 
that the typical way for x^ to be confused with x B is for the squared Euclidean distance 
\h \ | 2 |g?i | 2 + \ h 2 | 2 |d2 | 2 between the received codewords to be of the order of 1/SNR. This 
event holds roughly when both |/ri| 2 |di| 2 and | ^ 2 1 2 1 <^ 2 1 2 are of the order of 1/SNR, and 
this happens with probability approximately 
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Thus, it is important that both Id]/ 2 and \d 2 \~ are large to ensure diversity against 
fading in both components. 

It is interesting to see how this code compares to the repetition scheme. To keep 
the bit rate the same (2 bits over 2 real- valued symbols), the repetition scheme would 
be using 4- PAM modulation {—36, —6, 6, 36}. The codewords of the repetition scheme 
are shown in Figure 3.8(b). From (3.51), the pairwise error probability between two 
adjacent codewords (say, xq and x s ) is 



P{x A - x B } = E \Q ( y/SNR/2 • ( | h\ | 2 1 d \ | 2 + |6, 2 | 2 |d 2 | 2 ) 



(3.61) 



But now SNR = 56 2 /Nq is the average SNR per symbol time for the 4- PAM constel- 
lation, 6 and d\ = d 2 = are the normalized component differences between the 

adjacent codewords. The minimum squared product distance for the repetition code 
is therefore 16/25 and we can compare this to the minimum squared product distance 
of 16/5 for the previous rotation code. Since the error probability is proportional to 
SNR~ 2 in both cases, we conclude that the rotation code has an improved coding gain 
over the repetition code in terms of a saving in transmit power by a factor of \/5 
(3.5 dB) for the same product distance. This improvement comes from increasing the 
overall product distance, and this is in turn due to spreading the codewords in the 
2-dimensional space rather than packing them on a single-dimensional line as in the 
repetition code. This is the same reason that QPSK is more efficient than BPSK (as 
we have discussed in Section 3.1.3). 

We summarize and generalize the above development to any time diversity code. 



6 As we have seen earlier, the 4-PAM constellation requires 5 times more energy than BPSK for 
the same separation between the constellation points. 
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Summarly 3.1 



Time Diversity Code Design Criterion 



Ideal time-interleaved channel 

y e = h £ Xi + w e , i = l (3.62) 

where hi s are i.i.d. CJ\f{ 0, 1) Rayleigh faded channel gains. 

xi, . . . ,xjf are the codewords of a time diversity code with block length L, 
normalized such that 

1 M 

mlE INI 2 = 1 ' < 3 - 63 > 

2=1 



Union bound on overall probability of error: 

Pe - Jf ZJ P ^ 

Bound on pairwise error probability: 

L 1 

P lx* — > x 7 } < TT 5 — 

1 ~ 1 + SNR \xu — Xje\ 2 /4 

where xu is the component of codeword x,;, and SNR := 1/N 0 . 



(3.64) 



(3.65) 



Let Lij be the number of components on which the codewords x,; and Xj differ. 
Diversity gain of the code is 

min Lj t j . (3.66) 



If Lij = 

and 



where 



L for all % j, then the code achieves the full diversity L of the channel, 

4 l (M — 1) 



Pe < ^y^-SNR- L < 
1 ~ x . . - 






SNR 



-L 



U 



mm. 



i^j Oij 



(3.67) 



L 

8ij . 1 1 Xu Xji | 

i=i 



(3.68) 



is the squared product distance between x,- and x r 



Tse and Viswanath: Fundamentals of Wireless Communication 



81 




25Mhz 



200khz 
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8 users per subchannel 



Figure 3.9: The 25 Mhz band of a GSM system is divided into 200 kHz sub-channels 
which are further divided into time slots for 8 different users. 

The rotation code discussed above is specifically designed to exploit time diversity 
in fading channels. In the AWGN channel, however, rotation of the constellation 
does not affect performance since the i.i.d. Gaussian noise is invariant to rotations. 
On the other hand, codes that are designed for the AWGN channel, such as linear 
block codes or convolutional codes, can be used to extract time diversity in fading 
channels when combined with interleaving. Their performance can be analyzed using 
the general framework above. For example, the diversity gain of a binary linear block 
code where the coded symbols are ideally interleaved is simply the minimum, Hamming 
distance between the codewords or equivalently the minimum weight of a codeword; 
the diversity gain of a binary convolutional code is given by the free distance of the 
code, which is the minimum weight of the coded sequence of the convolutional code. 
The performance analysis of these codes and various decoding techniques is further 
pursued in Exercises 3.11 and 3.12. 

It should also be noted that the above code design criterion is derived assuming 
i.i.d. Rayleigh fading across the symbols. This can be generalized to the case when 
the coded symbols pass through correlated fades of the channel (see Exercise 3.13). 
Generalization to the case when the fading is Rician is also possible and is studied 
in Exercise 3.19. Nevertheless these code design criteria all depend on the specific 
channel statistics assumed. Motivated by information theoretic considerations, we 
take a completely different approach in Chapter ?? where we seek a universal criterion 
which works for all channel statistics. 
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User l’s time slots 



Figure 3.10: How interleaving is done in GSM. 



Example 3.4: Time Diversity in GSM 
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Global System for Mobile (GSM) is a digital cellular standard developed in 
Europe in the 1980’s. GSM is a frequency division duplex (FDD) system and uses 
two 25 MHz bands, one for the uplink (mobiles to base station) and one for the 
downlink (base station to mobiles). The original bands set aside for GSM are the 
890-915 MHz band (uplink) and the 935-960 MHz band (downlink). The bands 
are further divided into 200 kHz sub-channels and each sub-channel is shared by 8 
users in a time division fashion (time-division multiple access (TDMA)). The data 
of each user are sent over time slots of length 577 microseconds (/is) and the time 
slots of the 8 users together form a frame of length 4.615 ms (Figure 3.9). 

Voice is the main application for GSM. Voice is coded by a speech encoder 
into speech frames each of length 20 ms. The bits in each speech frame are 
encoded by a convolutional code of rate 1/2, with the two generator polynomials 
D 4 + D 3 + 1 and D 4 + D 3 + D + 1. The free distance of this code is ????. The 
number of coded bits for each speech frame is 456. To achieve time diversity, 
these coded bits are interleaved across 8 consecutive time slots assigned to that 
specific user: the 0 th , 8 th , . . ., 448 th bits are put into the first time slot, the 1 st , 

9 th , . . ., 449 th bits are put into the second time slot, etc. Since one time slot 
occurs every 4.615 ms for each user, this translates into a delay of roughly 40 ms, 
a delay judged tolerable for voice. The 8 time slots are shared between two 20 ms 
speech frames. The interleaving structure is summarized in Figure 3.10. 

The maximum possible time diversity gain is 8, but the actual gain that can 
be obtained depends on how fast the channel varies, and that depends primarily 
on the mobile speed. If the mobile speed is v, then the largest possible Doppler 
spread (assuming full scattering in the environment) is D s = 2f c v/c, where f c is 
the carrier frequency and c is the speed of light. (Recall the example in Section 
2.1.4.) The coherence time is roughly T c = 1/(4 D s ) = c/(8f c v ) (c.f. (2.45)). For 
the channel to fade more or less independently across the different time slots for a 
user, the coherence time should be less than 5 ms. For f c = 900 MHz, this 
translates into a mobile speed of at least 30 km/h. 

For a walking speed of say 3 km/h, there may be too little time diversity. In 
this case, GSM can go into a frequency hopping mode, where consecutive frames 
(each composed of the time slots of the 8 users) can hop from one 200 kHz 
sub-channel to another. With a typical delay spread of about 1/zs, the coherence 
bandwidth is 500 kHz (c.f. Table 2.1). The total bandwidth equal to 25 MHz is 
thus much larger than the typical coherence bandwidth of the channel and the 
consecutive frames can be expected to fade independently. This provides the same 
effect as having time diversity. Section 3.4 discusses other ways to exploit 
frequency diversity. 
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3.3 Antenna Diversity 

To exploit time diversity, interleaving and coding over several coherence time periods 
is necessary. When there is a strict delay constraint and/or the coherence time is large, 
this may not be possible. In this case other forms of diversity have to be obtained. 
Antenna diversity, or spatial diversity, can be obtained by placing multiple antennas 
at the transmitter and/or the receiver. If the antennas are placed sufficiently far apart, 
the channel gains between different antenna pairs fade more or less independently, and 
independent signal paths are created. The required antenna separation depends on the 
local scattering environment as well as on the carrier frequency. For a mobile which 
is near the ground with many scatterers around, the channel decorrelates over shorter 
spatial distances, and typical antenna separation of half to one carrier wavelength is 
sufficient. For base stations on high towers, larger antenna separation of several to 10’s 
of wavelengths may be required. (A more careful discussion of these issues is found in 
Chapter ??.) 

We will look at both receive diversity, using multiple receive antennas (single- input, 
multi-output SIMO channels) , and transmit diversity, using multiple transmit anten- 
nas (multi-input, single-output MISO channels). Interesting coding problems arise 
in the latter and have led to recent excitement in space-time codes. Channels with 
multiple transmit and multiple receive antennas (so-called multi-input multi-output 
or MIMO channels) provide even more potential. In addition to providing diversity, 
MIMO channels also provide additional degrees of freedom, for communication. We 
will touch on some of the issues here using a 2 x 2 example; the full study of MIMO 
communication will be the subject of Chapters 7 to 9. 

3.3.1 Receive Diversity 

In a flat fading channel with 1 transmit antenna and L receive antennas (Figure 
3.11(a)), the channel model is as follows: 

ye[m\ = h([m\x[m\ + W([m] f = 1, . . . , L (3.69) 

where the noise up; [m] ~ CAf(0,N o ) and independent across the antennas. We would 
like to detect x[l] based on yi [1], ... , yz/1]. This is exactly the same detection problem 
as in the use of a repetition code over time, with L diversity branches now over space 
instead of over time. If the antennas are spaced sufficiently far apart, then we can 
assume that the gains hg[ 1] are independent Rayleigh, and we get a diversity gain of 
L. 

With receive diversity, there are actually two types of gain as we increase L. This 
can be seen by looking at the expression (3.34) for the error probability of BPSK 
conditioned on the channel gains: 

Q ( v / 2pIpSNR) . 



(3.70) 
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(a) (b) (c) 

Figure 3.11: (a) Receive diversity; (b) transmit diversity; (c) transmit and receive 
diversity. 

We can break up the total received SNR conditioned on the channel gains into a 
product of two terms: 

||h|| 2 SNR = LSNR- }||h|| 2 . (3.71) 

The first term corresponds to a power gain (also called array gain): by having multiple 
receive antennas and coherent combining at the receiver, the effective total received 
signal power increases linearly with L : doubling L yields a 3 dB power gain. 7 The 
second term reflects the diversity gain: by averaging over multiple independent signal 
paths, the probability that the overall gain is small is decreased. The diversity gain L is 
reflected in the SNR exponent in (3.41); the power gain affects the constant before the 
1/SNR l . Note that if the channel gains h([ 1] are fully correlated across all branches, 
then we only get a power gain but no diversity gain as we increase L. On the other 
hand, even when all the he are independent there is a diminishing marginal return as 
L increases: due to the law of large numbers, the second term in (3.71), 

jini 2 = jEwtr. (3-72) 

n e=i 

converges to 1 with increasing L (assuming each of the channel gains is normalized to 
have unit variance). The power gain, on the other hand, suffers from no such limitation: 
a 3 dB gain is obtained for every doubling of the number of antennas. 8 

7 Although mathematically the same situation holds in the time diversity repetition coding case, 
the increase in received SNR there comes from increasing the total transmit energy required to send 
a single bit; it is therefore not appropriate to call that a power gain in that scenario. 

8 This will of course ultimately not hold since the received power cannot be larger than the transmit 
power, but the number of antennas for our model to break down will have to be humongous. 
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3.3.2 Transmit Diversity: Space-Time Codes 

Now consider the case when there are L transmit antennas and 1 receive antenna, the 
MISO channel (Figure 3.11(b)). This is common in the downlink of a cellular system 
since it is often cheaper to have multiple antennas at the base station than to having 
multiple antennas at every handset. It is easy to get a diversity gain of L : simply 
transmit the same symbol over the L different antennas during L symbol times. At 
any one time, only one antenna is turned on and the rest are silent. This is simply 
a repetition code, and, as we have seen in the previous section, repetition codes are 
quite wasteful of degrees of freedom. More generally, any time diversity code of block 
length L can be used on this transmit diversity system: simply use one antenna at 
a time and transmit the coded symbols of the time diversity code successively over 
the different antennas. This provides a coding gain over the repetition code. One can 
also design codes specifically for the transmit diversity system. There have been a lot 
of research activities in this area under the rubric of space-time coding and here we 
discuss the simplest, and yet one of the most elegant, space-time code: the so-called 
Alamouti scheme. This is the transmit diversity scheme proposed in several third- 
generation cellular standards. Alamouti scheme is designed for 2 transmit antennas; 
generalization to more than 2 antennas is possible, to some extent. 



Alamouti Scheme 



With flat fading, the two transmit, single receive channel is written as 



y[m] = hi[m]xi[m] + h 2 [m\x 2 [m\ + w[m], 



(3.73) 



where h % is the channel gain from transmit antenna i. The Alamouti scheme transmits 
two complex symbols Ui and u 2 over two symbol times: at time 1, aq[l] = Mi, x 2 [l] = m 2 ; 
at time 2, X\[2] = — u 2 ,x 2 [2] — u\. If we assume that the channel remains constant 
over the two symbol times and set h\ = hi[l] = hi[2],h 2 = h 2 [l] = h 2 [ 2], then we can 
write in matrix form: 



. y[ l \ y [ 2 ] ] = [ h i h 2 _ 



Ml -u* 2 
u 2 m* 



+ m;[1] w[2\ . 



(3.74) 



We are interested in detecting mi,m 2 , so we rewrite this equation as 



’ si 1 ] 




hi 


h 2 


Ml 


+ 


’ M 1 ] 


. vW . 




h* 


-hi _ 


_ “2 _ 


w[2]* 



(3.75) 



We observe that the columns of the square matrix are orthogonal. Hence, the detection 
problem for Ui,u 2 decomposes into two separate, orthogonal, scalar problems. We 
project y onto each of the two columns to obtain the sufficient statistics 



* = 1 , 2 , 



n = ||h||Mj + Wi, 



(3.76) 
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where h = [hi,h 2 Y and Wi ~ CJ\f(0, N 0 ) and w\,w 2 are independent. Thus, the 
diversity gain is 2 for the detection of each symbol. Compared to the repetition code, 
2 symbols are now transmitted over two symbol times instead of 1 symbol, but with 
half the power in each symbol (assuming that the total transmit power is the same in 
both cases). 

The Alamouti scheme works for any constellation for the symbols Mi, M 2 , but sup- 
pose now they are BPSK symbols, thus conveying a total of 2 bits over 2 symbol times. 
In the repetition scheme, we need to use 4- PAM symbols to achieve the same data rate. 
To achieve the same minimum distance as the BPSK symbols in the Alamouti scheme, 
we need 5 times the energy per symbol. Taking into account the factor of 2 energy 
saving since we are only transmitting one symbol at a time in the repetition scheme, 
we see that the repetition scheme requires a factor of 2.5 (4 dB) more power than the 
Alamouti scheme. Again, the repetition scheme suffers from an inefficient utilization 
of the available degrees of freedom in the channel: over the two symbol times, bits 
are packed into only one dimension of the received signal space, namely along the di- 
rection [/?,i,/r 2 ]h In contrast, the Alamouti scheme spreads the information onto two 
dimensions - along the orthogonal directions [hi, h^Y and [h 2 , — h*YY- 



The Determinant Criterion for Space-time Code Design 

In Section 3.2, we saw that a good code exploiting time diversity should maximize 
the minimum product distance between codewords. Is there an analogous notion for 
space-time codes? To answer this question, let us think of a space-time code as a 
set of complex codewords {Xj}, where each X,; is an L by N matrix. Here, L is the 
number of transmit antennas and N is the block length of the code. For example, in 
the Alamouti scheme, each codeword is of the form 



Mi —Mg 

m 2 u\ 



(3.77) 



with L = 2 and N 
form 



2. In contrast, each codeword in the repetition scheme is of the 



m 0 

0 M 



(3.78) 



More generally, any block length L time diversity code with codewords {xj} translates 
into a block length L transmit diversity code with codeword matrices {Xj}, where 



Xj = cliag {xn, ...,x iL }. 



(3.79) 



For convenience, we normalize the codewords so that the average energy per symbol 
time is 1, hence SNR = 1 /N 0 . Assuming that the channel remains constant for N 
symbol times, we can write 



y t = h*X + w 4 , 



(3.80) 
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where 

" y[ 1] 1 r h* 1 r w[ 1] ' 

y := _ , h := w := _ . (3.81) 

. y[N] \ [h* L \ [ w[N } _ 

To bound the error probability, consider the pairwise error probability of confusing 
Xb with Xa, when X A is transmitted. Conditioned on the fading gains h, we have 
the familiar vector Gaussian detection problem (see Summary 2): here we are deciding 
between the vectors h*X A and h*X# under additive circular symmetric white Gaussian 
noise. A sufficient statistic is 3?{v*y}, where v := h*(X A — X^). The conditional 
pairwise error probability is 



P{X A -X B |h} = Q 




Hence, the pairwise error probability averaged over the channel statistics is 



P{X A ->X B } 




(3.82) 



(3.83) 



The matrix (X A — X^)(X A - X^)* is Hermitian 9 and is thus diagonalizable by a 
unitary transformation, i.e., we can write (X A — X B )(X A — X B )* = UAU*, where U is 
unitary 10 and A = diag {A 9 , . . . , A|}. Here Xe ’s are the singular values of the codeword 
difference matrix X A — X#. Therefore, we can rewrite the pairwise error probability 



as 



P{X A ^X B } 




(3.84) 



where h := U*h. In the Rayleigh fading model, the fading coefficients he are i.i.d. 
CJ\f( 0, 1) and then h has the same distribution as h (c.f. (A. 22) in Appendix A). Thus 
we can bound the average pairwise error probability, as in (3.54), 



P{X A ^X B }<f] 



L 1 + SNRA2/4' 



(3.85) 



If all the Xj are strictly positive for all the codeword differences, then the maximal 
diversity gain of L is achieved. Since the number of positive eigenvalues Xj equals the 

9 A complex square matrix X is Hermitian if X* = X. 

10 A complex square matrix U is unitary if U*U = UU* = I. 
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rank of the codeword difference matrix, this is possible only if N > L. If indeed all 
the Xj; are positive, then, 



P{X A ->X B } < 



4 l 

snr l nti a? 

4 l 

SNR l det[(X A - X B )(X A - X B )*] ’ 



(3.86) 



and a diversity gain of L is achieved. The coding gain is determined by the minimum of 
the determinant det[(X A — X B )(X A — X B )*] over all codeword pairs. This is sometimes 
called the determinant criterion. 

In the special case when the transmit diversity code comes from a time diversity 
code, the space-time code matrices are diagonal (c.f. (3.79)), and Xg = \dg | 2 , the squared 
magnitude of the component difference between the corresponding time diversity code- 
words. The determinant criterion then coincides with the squared product distance 
criterion (3.68) we already derived for time diversity codes. 

We can compare the coding gains obtained by the Alamouti scheme with the rep- 
etition scheme. That is, how much less power does the Alamouti scheme consume to 
achieve the same error probability as the repetition scheme? For the Alamouti scheme 
with BPSK symbols Uj , the minimum determinant is 4. For the repetition scheme with 
4-PAM symbols, the minimum determinant is 16/25. (Verify!) This translates into the 
Alamouti scheme having a coding gain of roughly a factor of 6 over the repetition 
scheme, consistent with the analysis above. 

The Alamouti transmit diversity scheme has a particularly simple receiver struc- 
ture. Essentially, a linear receiver allows us to decouple the two symbols sent over 
the two transmit antennas in two time slots. Effectively, both symbols pass through 
non-interfering parallel channels, both of which afford a diversity of order 2. In Ex- 
ercise 3.17, we derive some properties a code construction must satisfy to mimic this 
behavior for more than 2 transmit antennas. 



3.3.3 MIMO: A 2 x 2 Example 

Degrees of Freedom 

Consider now a MIMO channel with two transmit and two receive antennas (Figure 
3.11(c)). Let hij be the Rayleigh distributed channel gain from transmit antenna j 
to receive antenna i. Suppose both the transmit antennas and the receive antennas 
are spaced sufficiently far apart such that the fading gains Ay’s can be assumed to be 
independent. There are four independently faded signal paths between the transmitter 
and the receiver, suggesting that the maximum diversity gain that can be achieved is 4. 
The same repetition scheme described in the last section can achieve this performance: 
transmit the same symbol over the two antennas in two consecutive symbol times (at 
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each time, nothing is sent over the other antenna). If the transmitted symbol is x, the 
received symbols at the two receive antennas are 

y.i[l] = hux + Wi[l\, i = 1,2 (3.87) 

at time 1, and 

Vi[ 2] = h i2 x + Wi[ 2], i = 1, 2 (3.88) 

at time 2. By performing maximal-ratio combining of the four received symbols, an 
effective channel with gain Yln= i X^=i l^y| 2 is created, yielding a 4-folcl diversity gain. 

However, just as in the case of the 2x1 channel, the repetition scheme utilizes the 
degrees of freedom in the channel poorly; it only transmits one data symbol per two 
symbol times. In this regard, the Alamouti scheme performs better by transmitting two 
data symbols over two symbol times. Exercise 3.21 shows that the Alamouti scheme 
used over the 2x2 channel provides effectively two independent channels, analogous 

to (3.76), but with the gain in each channel equal to Y^l=iY^j=i \^ij\ 2 - Thus, both 

the data symbols see a diversity gain of 4, the same as that offered by the repetition 
scheme. 

But does the Alamouti scheme utilize all the available degrees of freedom in the 
2x2 channel? How many degrees of freedom does the 2x2 channel have anyway? 

In Section 2.2.3 we have defined the degrees of freedom of a channel as the dimension 
of the received signal space. In a channel with two transmit and a single receive 
antenna, this is equal to one for every symbol time. The repetition scheme utilizes 
only half a degree of freedom per symbol time, while the Alamouti scheme utilizes all 
of it. 

With L receive, but a single transmit antenna, the received signal lies in an L- 
dimensional vector space, but it does not span the full space. To see this explicitly, 
consider the channel model from (3.69) (suppressing the symbol time index m): 

y = ha: + w, (3.89) 

where y := [yi, . . . , yz]* , h = [hi, ... , h l] and w = [w\, . . . , icj. The signal of interest, 
ha;, lies in a one-dimensional space. 11 Thus, we conclude that the degrees of freedom 
of a multiple receive, single transmit antenna channel is still 1 per symbol time. 

But in a 2 x 2 channel, there are potentially two degrees of freedom per symbol 
time. To see this, we can write the channel as 

y = hia;i + h 2 a ;2 + w, (3.90) 

where Xj and hj are the transmitted symbol and the vector of channel gains from 
transmit antenna j respectively, and y = [ 2 / 1 , 2 / 2 ]* and w = [wi,w 2 ] t are the vectors of 
received signals and CA/"(0, Nq) noise respectively. As long as hi and h 2 are linearly 



n This is why the scalar (h*/||h||)y is a sufficient statistic to detect x (c.f. (3.33)). 





Figure 3.12: (a) In the 1x2 channel, the signal space is 1-dimensional, spanned by h. 
(b) In the 2x2 channel, the signal space is 2-dimensional, spanned by hi and I 12 



independent, the signal space dimension is 2: the signal from transmit antenna j arrives 
in its own direction hj and with two receive antennas, the receiver can distinguish 
between the two signals. Compared to a 2 by 1 channel, there is an additional degree 
of freedom coming from space. Figure 3.12 summarizes the situation. 



Spatial Multiplexing 

Now we see that neither the repetition scheme nor the Alamonti scheme utilize all 
the degrees of freedom in a 2 x 2 channel. A very simple scheme that does is the 
following: transmit independent uncoded symbols over the different antennas as well 
as over the different symbol times. This is an example of a spatial multiplexing scheme: 
independent data streams are multiplexed in space. (It is also called V-BLAST in the 
literature.) To analyze the performance of this scheme, we extend the derivation of 
the pairwise error probability bound (3.85) from a single receive antenna to multiple 
receive antennas. Exercise 3.20 shows that with n r receive antennas, the corresponding 
bound on the probability of confusing codeword X B with codeword is 



Xb} - [Jl + SNRA’/t 



where A/s are the singular values of the codeword difference X^ — X B . This bound 
holds for space-time codes of general block lengths. Our specific scheme does not 
code across time and is thus “space-only”. The block length is 1, the codewords are 
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2-dimensional vectors X!,x 2 and the bound simplifies to 



P {xi -> x 2 } < 

< 



[l + SNR ||xi — x 2 1| 2 / 4 
16 

SNR 2 | j X! — x 2 1| 4 



(3.92) 



The exponent of the SNR factor is the diversity gain: the spatial multiplexing scheme 
achieves a diversity gain of 2. Since there is no coding across the transmit antennas, it 
is clear that no transmit diversity can be exploited; thus the diversity comes entirely 
from the dual receive antennas. The factor ||x! — x 2 || 4 plays a role analogous to the 
determinant det [(X^ — Xb)(Xa — X#)*] in determining the coding gain (c.f. (3.86)). 

Compared to the Alamouti scheme, we see that V-BLAST has a smaller diversity 
gain (2 compared to 4). On the other hand, the full use of the spatial degrees of freedom 
should allow a more efficient packing of bits, resulting in a better coding gain. To see 
this concretely, suppose we use BPSK symbols in the spatial multiplexing scheme to 
deliver 2 bits/s/Hz. Assuming that the average transmit energy per symbol time is 
normalized to be 1 as before, we can use (3.92) to explicitly calculate a bound on the 
worst-case pairwise error probability: 



max P (xj — ■» Xj} < 4 • SNR 2 . (3.93) 

*7 4 ? 



On the other hand, the corresponding bound for the Alamouti scheme using 4-PAM 
symbols to deliver the same 2 bits/s/Hz can be calculated from (3.86) to be 

rnaxP {xj — ► Xj} < 1600 ■ SNR -4 . (3.94) 

*7 V 



We see that indeed the bound for the Alamouti scheme has a much poorer constant 
before the factor that decays with SNR. 

We can draw two lessons from the V-BLAST scheme. First, we see a new role for 
multiple antennas: in addition to diversity, they can also provide additional degrees 
of freedom for communication. This is in a sense a more powerful view of multiple 
antennas, one that will be further explored in Chapter 7. Second, the scheme also 
reveals limitations in our performance analysis framework for space-time codes. In 
the earlier sections, our approach has always been to seek schemes which extract the 
maximum diversity from the channel and then they were compared on the basis of the 
coding gain, which is a function of how efficient the schemes utilize the available degrees 
of freedom. This approach falls short in comparing V-BLAST and the Alamouti scheme 
for the 2x2 channel: V-BLAST has poorer diversity than the Alamouti scheme but 
yet is more efficient in exploiting the spatial degrees of freedom, resulting in a better 
coding gain. A more powerful framework combining the two performance measures 
into a unified metric is needed; this is one of the main subjects of Chapter ??. There 
we will also address the issue of whether it is possible to find a scheme which achieves 
full diversity and the full degrees of freedom of the channel. 
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Low-Complexity Detection: the Decorrelator 

One advantage of the Alamouti scheme is its low-complexity ML receiver: the decoding 
decouples into two orthogonal single-symbol detection problems. ML detection of V- 
BLAST does not enjoy the same advantage: joint detection of the two symbols is 
required. The complexity grows exponentially with the number of antennas. A natural 
question to ask is: what performance can sub-optimal single-symbol detectors achieve? 
We will study MIMO receiver architectures in depth in Chapters 7 and ??, but here we 
will give an example of a simple detector, the decorrelator, and analyze its performance 
in the 2x2 channel. 

To motivate the definition of this detector, let us rewrite the channel (3.90) in 
matrix form: 

y = Hx + w, (3.95) 

where H = [h!,h 2 ] is the channel matrix. The input x := [aq,^]* is composed of two 
independent symbols aq,o; 2 To decouple the detection of the two symbols, one idea is 
to invert the effect of the channel: 



y = H 1 y = x + H 1 w = x + w 



(3.96) 



and detect each of the symbols separately. This is in general suboptimal compared 
to joint ML detection, since the noise samples vb\ and w 2 are correlated, flow much 
performance do we lose? 

Let us focus on the detection of the symbol aq from transmit antenna 1. By direct 
computation, the variance of the noise uq is 

IM 2 + 1^-21 

\hnli22 — h,2\h 

Hence, we can rewrite the first component of the vector equation in (3.96) as 



Wn 



12 



(3.97) 



m = xi + 



\/ 1 h'22 | 2 + | /All 2 
\hnh22 — h 21/112! 



Zl, 



(3.98) 



where z\ ~ N 0 ), the scaled version of uq, is independent of aq. Equivalently, 

the scaled output can be written as 



y'i 



h\\h,22 ~ h-2ih\2 _ 
h'22 1 2 T | h’21 1 2 y 

(02hi)oq + zi, 



(3.99) 



where 



r hi 1 


rh- ■- 1 


1 


CM 

1 


’ V\hi 2 \* + \ha \ 2 


1 

1 

sr- 

* 

1 



i = 1,2. 



(3.100) 
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Figure 3.13: Demodulation of X\: the received vector y is projected onto the direction 
02 orthogonal to h 2 - The effective channel for x\ is in deep fade whenever the projection 
of hi onto 02 is small. 



Geometrically, one can interpret hj as the “direction” of the signal from transmit 
antenna j and 0j as the direction orthogonal to hj. Equation (3.99) says that when 
demodulating the symbol from antenna 1, channel inversion eliminates the interference 
from transmit antenna 2 by projecting the received signal y in the direction orthogonal 
to h 2 (Figure 3.13). The signal part is ^hijoq. The scalar gain 0^! is circular 
symmetric Gaussian, being the projection of a two-dimensional i.i.d. circular symmetric 
Gaussian random vector (hi) onto an independent unit vector (02) (c.f. (A. 22) in 
Appendix A). The scalar channel (3.99) is therefore Rayleigh faded like a lxl channel 
and has only unit diversity. Note that if there were no interference from antenna 2, 
the diversity gain would have been 2: the norm || hi || 2 of the entire vector hi has to be 
small for poor reception of X\. However, here, the component of hi perpendicular to h 2 
being small already wrecks havoc; this is the price paid for nulling out the interference 
from antenna 2. In contrast, the ML detector, by jointly detecting the two symbols, 
retains the diversity gain of 2. 

We have discussed V-BLAST in the context of a point-to-point link with two trans- 
mit antennas. But since there is no coding across the antennas, we can equally think 
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of the two transmit antennas as two distinct users each with a single antenna. In the 
multiuser context, the receiver described above is sometimes called the interference 
nuller, zero-forcing receiver or the decorrelator. It nulls out the effect of the other 
user (interferer) while demodulating the symbol of one user. Using this receiver, we 
see that dual receive antennas can perform one of two functions in a wireless system: 
they can either provide a 2-fold diversity gain in a point-to-point link when there is 
no interference, or they can be used to null out the effect of an interfering user but 
provide no diversity gain more than 1. But they cannot do both. This is however not 
an intrinsic limitation of the channel but rather a limitation of the decorrelator; by 
performing joint ML detection instead, the two users can in fact be simultaneously 
supported with a 2-fold diversity gain each. 



Summarjy 3.2 2x2 MIMO Schemes 

The performance of the various schemes for the 2x2 channel is summarized 
below. 





Diversity gain 


Degrees of freedom utilized 
per symbol time 


Repetition 


4 


1/2 


Alamouti 


4 


1 


V-BLAST (ML) 


2 


2 


V-BLAST (nulling) 


1 


2 


channel itself 


4 


2 



3.4 Frequency Diversity 

3.4.1 Basic Concept 

So far we have focused on narrowband flat fading channels. These channels are modeled 
by a single-tap filter, as most of the multipaths arrive during one symbol time. In 
wideband channels, however, the transmitted signal arrives over multiple symbol times 
and the multipaths can be resolved at the receiver. The frequency response is no longer 
flat, i.e. , the transmission bandwidth W is greater than the coherence bandwidth W c 
of the channel. This provides another form of diversity: frequency. 

We begin with the discrete-time baseband model of the wireless channel in Sec- 
tion 2.2. Recalling (2.35) and (2.39), the sampled output y[m ] can be written as 

y[m\ = E he[m] x[m — £\ + w[m\. 

i 



(3.101) 
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Here h([m] denotes the 0 th channel filter tap at time m. To understand the concept of 
frequency diversity in the simplest setting, consider first the one-shot communication 
situation when one symbol x[0] is sent at time 0, and no symbols are transmitted after 
that. The receiver observes 

y[i\ = h £ [£\x[ 0\+w[£\, £ = 0,1,2,... (3.102) 

If we assume that the channel response has a finite number of taps L, then the delayed 
replicas of the signal are providing L branches of diversity in detecting x [0] , since the 
tap gains h([f\ are assumed to be independent. This diversity is achieved by the ability 
of resolving the multipaths at the receiver due to the wideband nature of the channel, 
and is thus called frequency diversity. 

A simple communication scheme can be built on the above idea by sending an 
information symbol every L symbol times. The maximal diversity gain of L can be 
achieved, but the problem with this scheme is that it is very wasteful of degrees of 
freedom: only one symbol can be transmitted every delay spread. This scheme can 
actually be thought of as analogous to the repetition codes used for both time and 
spatial diversity, where one information symbol is repeated L times. In this setting, 
once one tries to transmit symbols more frequently, inter-symbol interference (ISI) 
occurs: the delayed replicas of previous symbols interfere with the current symbol. 
The problem is then how to deal with the ISI while at the same time exploiting the 
inherent frequency diversity in the channel. Broadly speaking, there are three common 
approaches: 

• single-carrier systems with equalization: By using linear and non-linear 

processing at the receiver, ISI can be mitigated to some extent. Optimal ML 
detection of the transmitted symbols can be implemented using the Viterbi al- 
gorithm. However, the complexity of the Viterbi algorithm grows exponentially 
with the number of taps, and it is typically used only when the number of signif- 
icant taps is small. Alternatively, linear equalizers attempt to detect the current 
symbol while linearly suppressing the interference from the other symbols, and 
they have lower complexity. 

• direct sequence spread spectrum: In this method, information symbols are 
modulated by a pseudonoise sequence and transmitted over a bandwidth W 
much larger than the data rate. Because the symbol rate is very low, ISI is 
small, simplifying the receiver structure significantly. Although this leads to 
an inefficient utilization of the total degrees of freedom in the system from the 
perspective of one user, this scheme allows multiple users to share the total 
degrees of freedom, with users appearing as pseudonoise to each other. 

• multi-carrier systems: Here, transmit precocling is performed to convert the 
ISI channel into a set of non- interfering, orthogonal sub-carriers, each experi- 
encing narrowband flat fading. Diversity can be obtained by coding across the 
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symbols in different sub-carriers. This method is also called Discrete Multi-Tone 
(DMT) or Orthogonal Frequency Division Multiplexing (OFDM). Frequency-hop 
spread spectrum can be viewed as a special case where one carrier is used at a 
time. 

For example, GSM is a single-carrier system, IS-95 CDMA and IEEE 802.11b (a 
wireless LAN standard) are based on direct sequence spread spectrum, and IEEE 
802.11a is a multi-carrier system, 

Below we study these three approaches in turn. An important conceptual point 
is that, while frequency diversity is something intrinsic in a wideband channel, the 
presence of ISI is not, as it depends on the modulation technique used. For example, 
under OFDM, there is no ISI, but sub-carriers that are separated apart by more than 
the coherence bandwidth fade more or less independently and hence frequency diversity 
is still present. 

Narrowband systems typically operate in a relatively high SNR regime. In contrast, 
the energy is spread across many degrees of freedom in many wideband systems, and the 
impact of the channel uncertainty on the ability of the receiver to extract the inherent 
diversity in frequency-selective channels becomes more pronounced. This point will be 
discussed in Section 3.5, but in the present section, we assume that the receiver has a 
perfect estimate of the channel. 

3.4.2 Single-Carrier with ISI Equalization 

Single carrier with ISI equalization is the classic approach to communication over 
frequency-selective channels, and has been used in wireless as well as wireline applica- 
tions such as voiceband modems. Much work has been done in this area but here we 
focus on the diversity aspects. 

Starting at time 1, a sequence of uncoded independent symbols x[l], a: [2], . . . is 
transmitted over the frequency-selective channel (3.101). Assuming that the channel 
taps do not vary over these N symbol times, the received symbol at time m is: 

L—l 

y[m ] = E hex[m — £] + w[m\, (3.103) 

£=0 

where x[m] = 0 for m < 1. For simplicity, we assume here that the taps hp are i.i.d. 
Rayleigh with equal variance 1/L, but the discussion below holds more generally (see 
Exercise 3.26). 

We want to detect each of the transmitted symbols from the received signal. The 
process of extracting the symbols from the received signal is called equalization. In 
contrast to the simple scheme in the previous section where a symbol is sent every L 
symbol times, here a symbol is sent every symbol time and hence there is significant 
ISI. Can we still get the maximum diversity gain of L, even though there is no coding 
across the transmitted symbols? 
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Frequency-Selective Channel Viewed as a MISO Channel 



To analyze this problem, it is insightful to transform the frequency-selective channel 
into a flat fading MISO channel with L transmit antennas and a single receive antenna 
and channel gains ho, ■ ■ ■ , Hl-i- Consider the following transmission scheme on the 
MISO channel: at time 1, the symbol x[l] is transmitted on antenna 1 and the other 
antennas are silent. At time 2, x[l] is transmitted at antenna 2, x[2] is transmitted on 
antenna 1 and the other antennas remain silent. At time m, x[m — (] is transmitted 
on antenna £+ 1, for l — 0, . . . , L — 1. See Figure 3.14. The received symbol at time m 
in this MISO channel is precisely the same as that in the frequency-selective channel 
under consideration. 

Once we transform the frequency-selective channel into a MISO channel, we can 
exploit the machinery developed in Section 3.3.2. First, it is clear that if we want to 
achieve full diversity on a symbol, say x [V] , we need to observe the received symbols 
up to time N + L — 1. Over these symbol times, we can write the system in matrix 
form (as in (3.80)): 

y l = h*X + w* (3.104) 

where y f := [y[l \, . . . , y[N + L — 1]], h* := [h 0 ,...,h L _ J,w‘ := [w[0\, . . . w[N + L - 1]] 
and the L by N + L — 1 space-time code matrix 



" z[l] 


*[2] 




■ a; [iV] 


■ x[N + L — 1] ' 


0 


*[!] 


*[2] . 


x[N] 


• x[N + L- 2] 


0 


0 


rr[l] x[2] 






0 


0 




x[l] x[2] 


x[N] 



(3.105) 



corresponds to the transmitted sequence x = [a;[l], . . . , x[N + L — 1]]*. 



Error Probability Analysis 

Consider the maximum likelihood detection of the sequence x based on the received 
vector y (MLSD). With MLSD, the pairwise error probability of confusing x^ with 
xb, when x^ is transmitted is, as in (3.85), 



P{xn 



L 



*b} < n 
£= 1 



1 

1 + SNR Af/4’ 



(3.106) 



where A^’s are the eigenvalues of the matrix (X^ — X^)(X J 4 — X#)* and SNR is the total 
received SNR per received symbol (summing over all paths). This error probability 
decays like SNR~ L whenever the difference matrix X 4 — X# is of rank L. 
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Figure 3.14: The MISO scenario equivalent to the frequency-selective channel. 
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By a union bound argument, the probability of detecting the particular symbol 
x[N] incorrectly is bounded by 

P{x A ->x B }, (3.107) 

x b -.xb[N]^x a [N] 



summing over all the transmitted vectors xg which differ with x A in the N th symbol. 12 
To get full diversity, the difference matrix X A — X# must be full rank for every such 
vector xg (c.f. (3.86)). Suppose m* is the symbol time in which the vectors x A and 
Xb first differ. Since they differ at least once within the first N symbol times, m* < N 
and the difference matrix is of the form 



X A -X B = 



0 

0 

0 



0 x A [m*] — x B [m*] 

0 XA[m*] — xslm*] 

0 



0 



0 x a [ rn*] — x B [m*] ■ J 
(3.108) 



By inspection, all the rows in the difference matrix are linearly independent. Thus 
X A - Xb is of full rank (i.e., the rank is equal to L). We can summarize: 



Uncoded transmission combined with maximum likelihood sequence detection 
achieves full diversity on symbol x[N] using the observations up to time iV+L — 1, 
i.e., a delay of L — 1 symbol times. 



Compared to the scheme in which a symbol is transmitted every L symbol times, the 
same diversity gain of L is achieved and yet an independent symbol can be transmitted 
every symbol time. This translates into a significant “coding gain” (Exercise 3.27). 

In the analysis here it was convenient to transform the frequency-selective channel 
into a MISO channel. However, we can turn the transformation around: if we transmit 
the space-time code of the form in (3.105) on a MISO channel, then we have converted 
the MISO channel into a frequency-selective channel. This is the delay diversity scheme 
and it was one of the first proposed transmit diversity schemes for the MISO channel. 

Implementing MLSD: the Viterbi Algorithm 

Given the received vector y of length n, MLSD requires solving the optimization prob- 
lem 

maxP {y|x} . (3.109) 

X 

12 Strictly speaking, the MLSD only minimizes the sequence error probability, not the symbol error 
probability. However, this is the standard detector implemented for ISI equalization via the Viterbi 
algorithm, to be discussed next. In any case, the symbol error probability performance of the MLSD 
serves as an upper bound to the optimal symbol error performance. 
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Figure 3.15: A finite state machine when x[m] are ±1 BPSK symbols and L — 2. 
There are a total of 4 states. 
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Figure 3.16: The trellis representation of the channel. 

m — 1 m 




Figure 3.17: The dynamic programming principle. If the first m — 1 segments of the 
shortest path to state s at stage m were not the shortest path to state u* at stage 
m— 1, then one could have found an even shorter path to state s. 
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A brute-force exhaustive search would require a complexity that grows exponentially 
with the block length n. An efficient algorithm needs to exploit the structure of the 
problem and moreover should be recursive in n so that the problem does not have to 
be solved from scratch for every symbol time. The solution is the ubiquitous Viterbi 
algorithm. 

The key observation is that the memory in the frequency-selective channel can be 
captured by a finite state machine, 
vector) 

s [m\ 



At time m, define the state (an L dimensional 



x[m — L + 1] 
x[m — L + 2] 



x\m\ 



(3.110) 



An example of the finite state machine when the x [m] ’s are BPSK symbols is given in 
Figure 3.15. The number of states is M L , where M is the constellation size for each 
symbol x[m\. 

The received symbol y[m] is given by 



y[m\ = IT s [m] + w[m], 



(3.111) 



with h representing the frequency-selective channel, as in (3.104). The MLSD problem 
(3.109) can be rewritten as 

min — log P {y[l], . . . , y[n\ | s[l], . . . , s[n}} , (3.112) 

s[l]„...,s[n] 

subject to the transition constraints on the state sequence (i.e., the second component 
of s [m] is the same as the first component of ,s [rn + 1]). Conditioned on the state 
sequence s [1] , . . . , s[n], the received symbols are independent and the log-likelihood 
ratio breaks into a sum: 



logP {?/[!]: • • • > y[ n \ |s[l],...,s[n]} = log P {y[m] |s[m]}. (3.113) 

m = 1 

The optimization problem in (3.112) can be represented as the problem of hireling 
the shortest path through an n-stage trellis, as shown in Figure 3.16. Each state se- 
quence (s [1] , . . . , s [n] ) is visualized as a path through the trellis, and given the received 
sequence y[l], . . . , y[n], the cost associated with the m th transition is 

c m (s[m]) := — logP{r/[m] | s[m]} . (3.114) 

The solution is given recursively by the optimality principle of dynamic programming. 
Let K«.(s) be the cost of the shortest path to a given state s at stage m. Then Fm(s) 
for all states s can be computed recursively: 
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Vl(s) = Ci(s) 




(3.115) 


V m ( s) = nrin [H m _i(u) + c m (s)] , 

U 


m > 1. 


(3.116) 



Here the minimization is over all possible states u, i.e., we only consider the states 
that the finite state machine can be in at stage m — 1 and, further, can still end up at 
state s at stage m. The correctness of this recursion is based on the following intuitive 
fact: if the shortest path to state s at stage m goes through the state u* at stage m — 1, 
then the part of the path up to stage m — 1 must itself be the shortest path to state 
u*. See Figure 3.17. Thus, to compute the shortest path up to stage m, it suffices 
to augment only the shortest paths up to stage m — 1, and these have already been 
computed. 

Once V m (s) is computed for all states s, the shortest path to stage m is simply the 
minimum of these values over all states s. Thus, the optimization problem (3.112) is 
solved. Moreover, the solution is recursive in n. 

The complexity of the Viterbi algorithm is linear in the number of stages n. Thus, 
the cost is constant per symbol, a vast improvement over brute-force exhaustive search. 
However, its complexity is also proportional to the size of the state space, which is M L , 
where M is the constellation size of each symbol. Thus, while MLSD can be done for 
channels with a small number of taps, it becomes impractical when L becomes large. 

The computational complexity of MLSD leads to an interest in seeking sub-optimal 
equalizers which yield comparable performance. Some candidates are linear equaliz- 
ers (such as the zero-forcing and minimum mean square error (MMSE) equalizers, 
which involve simple linear operations on the received symbols followed by simple hard 
decoders), and their decision-feedback versions (DFE’s), where previously detected 
symbols are removed from the received signal before linear equalization is performed. 
We will discuss these equalizers further in Section ??, where we exploit a correspon- 
dence between the MIMO channel and the frequency-selective channel. In Section ??, 
the diversity performance of these equalizers is analyzed. There, we will see that the 
MMSE-DFE receiver achieves the same diversity performance as MLSD. 

3.4.3 Direct Sequence Spread Spectrum 

A common communication system that employs a wide bandwidth is a direct sequence 
(DS) spread spectrum system. Its basic components are shown in Figure 3.18. Infor- 
mation is encoded and modulated by a pseudonoise (PN) sequence and transmitted 
over a bandwidth W. In contrast to the system we analyzed in the last section where 
an independent symbol is sent at each symbol time, the data rate R bits/s in a spread 
spectrum system is typically much smaller than the transmission bandwidth W Hz. 
The ratio W/R is sometimes called the processing gain of the system. For example, 
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Figure 3.18: Basic elements of a direct-sequence spread spectrum system. 



IS-95 (CDMA) is a direct sequence spread spectrum system. The bandwidth is 1.2288 
MHz and a typical data rate ( voice) is 9.6 kbits/s, so the processing gain is 128. 
Thus, very few bits are transmitted per degree of freedom per user. In spread spec- 
trum jargon, each sample period is called a chip, and another way of describing a 
spread spectrum system is that the chip rate (or the sample rate) is much larger than 
the data rate. 

Because the symbol rate per user is very low in a spread spectrum system, ISI is 
typically negligible and equalization is not required. Instead, as we will discuss next, 
a much simpler receiver called the Rake receiver can be used to extract frequency di- 
versity. In the cellular setting, multiple spread spectrum users would share the large 
bandwidth so that the aggregate bit rate can be high even though the rate of each 
user is low. The large processing gain of a user serves to mitigate the interference from 
other users, which appear as random noise. In addition to providing frequency diver- 
sity against multipath fading and allowing multiple access, spread spectrum systems 
serve other purposes, such as anti-jamming from intentional interferers, and achieving 
message privacy in the presence of other listeners. We will discuss the multiple access 
aspects of spread spectrum systems in Chapter 4. For now, we focus on how DS spread 
spectrum systems can achieve frequency diversity. 

The Rake Receiver 

Suppose we transmit one of two n-chips long pseudonoise sequences orx b ■ Consider 

the problem of binary detection over a wideband multipath channel. In this context, 
a binary symbol is transmitted over n chips. The received signal is given by 

y[m\ = E h([m]x[m — £\ + w[m\. (3.117) 

i 



We assume that hp[m] is nonzero only for £ — 0, . . . L — 1, i.e., the channel has L taps. 
One can think of L/W as the delay spread T d . Also, we assume that hp [m] does not 
vary with m during the transmission of the sequence, i.e., the channel is considered 
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time- invariant. This holds if n <C T C W, where T c is the coherence time of the channel. 
We also assume that there is negligible interference between consecutive symbols, so 
that we can consider the binary detection problem in isolation for each symbol. This 
assumption is valid if n 3> L, which is quite common in a spread spectrum system with 
high processing gain. Otherwise, ISI between consecutive symbols becomes significant, 
and an equalizer would be needed to mitigate the ISI. Note however we assume that 
simultaneously n S> T d W and n <C T C IT, which is possible only if T d <C T c . In a 
typical cellular system, T d is of the order of microseconds and T c of the order of 10’s of 
milliseconds, so this assumption is quite reasonable. (Recall from Chapter 2, Table 2.2 
that a channel satisfying this condition is called an underspread channel.) 

With the above assumptions, the output is just a convolution of the input with the 
LTI channel plus noise 

y[m] = (h* x)[m] + w[m], m — 1, . . . n + L (3.118) 

where he is the 0 th tap of the time- invariant channel filter response, with hi = 0 for f < 0 
and £ > L— 1. Assuming the channel h is known to the receiver, two sufficient statistics, 
r a and r B , can be obtained by projecting the received vector y := [y[l], . . . ,y[n + L]]* 
onto the n+L dimensional vectors and v B , where := [(/i*xa)[1], . . . , (h*XA)[n + 
L)Y and w B := [(h * x B )[l], ...,(h* x B )[n + L]}\ i.e., 

ta := v^y, r B := v* B y. (3.119) 

The computation of r a and r B can be implemented by first matched filtering the 
received signal to and to x B . The outputs of the matched filters are passed through 
a filter matched to the channel response h and then sampled at time n + L. (See 
Figure 3.19). This is called the Rake receiver. What the Rake actually does is take 
inner products of the received signal with shifted versions of the candidate transmitted 
sequences. Each output is then weighted by the channel tap gain of the appropriate 
delay and summed. The signal path associated with a particular delay is sometimes 
called a finger of the Rake receiver. 

As discussed earlier, we are continuing with the assumption that the channel gains 
he s are known at the receiver. In practice, these gains have to be estimated and tracked 
from either a pilot signal or in a decision-directed mode using the previously detected 
symbols. (The channel estimation problem will be discussed in Section 3.5.2.) Also, 
due to hardware limitations, the actual number of fingers used in a Rake receiver may 
be less than the total number of taps L in the range of the delay spread. In this case, 
there is also a tracking mechanism in which the Rake receiver continuously searches 
for the strong paths (taps) to assign the limited number of fingers to. 

Performance Analysis 

Let us now analyze the performance of the Rake receiver. To simplify our notation, 
we specialize to antipodal modulation (i.e., x ,4 = -xg = u); the analysis for other 
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Figure 3.19: The Rake receiver. Here, h is the filter matched to h, i.e., hi = h*_ t . Each 
tap of h represents a finger of the Rake. 

modulation schemes is similar. One key aspect of spread spectrum systems is that the 
transmitted signal (± u) has a pseudonoise characteristic. The defining characteristic 
of a pseudonoise sequence is that its shifted versions are nearly orthogonal to each 
other. More precisely, if we write u = [w[l], . . . , u[n]}, and 

u w := [0, . . . , 0, u[l], . . . , u[n], 0, . . . 0] 4 (3.120) 

as the n + L dimensional version of u shifted by £ chips (hence there are £ zeros 
preceding u and L — £ zeros following u above), the pseudonoise property means that 
for every £ — 0, . . . , L — 1, 

n 

I(u<'>)-(ul''))|«^|u[i]| 2 , l + e. (3.121) 

i = 1 

To simplify the analysis, we assume full orthogonality: (u^)*(u^ ')) = 0if£^f. 

We will now show that the performance of the Rake is the same as that in the 
diversity model with L branches for repetition coding described in Section 3.2. We can 
see this by looking at a set of sufficient statistics for the detection problem different 
from the ones we used earlier. First, we rewrite the channel model in vector form 

L—l 

y = h e x.( e) + w, (3.122) 

e=o 

where w := [iu[l], . . . , w[n + L]]* and = ±u^, the version of the transmitted 
sequence (either u or -u) shifted by £ chips. The received signal (without the noise) 
therefore lies in the span of the L vectors {u^/||u||}£. By the pseudonoise assumption, 
all these vectors are orthogonal to each other. A set of L sufficient statistics {r^}i 
can be obtained by projecting y onto each of these vectors 

= hix + w^\ 



£ = !,... ,L-1, 



(3.123) 
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where x = ±||u||. Further, the orthogonality of u^’s implies that w/^’s are i.i.d. 
CJ\f(0, N 0 ). Comparing with (3.32), this is exactly the same as the L-branch diversity 
model for the case of repetition code interleaved over time. Thus, we see that the Rake 
receiver in this case is nothing more than a maximal ratio combiner of the signals from 
the L diversity branches. The error probability is given by 



Pe = E 



Q 



\ 



2M| 2 X>'l7A'o 



e=i 



(3.124) 



If we assume a Rayleigh fading model such that the tap gains hf are i.i.d. CJ\f(Q, 1 /L), 
i.e., the energy is spread equally among all the L taps (normalizing such that the 
E \h (\ 2 ] = 1), then the error probability can be explicitly computed (as in (3.37)): 



Pe = 





(3.125) 



where 



/i : = 



SNR 
1 + SNR 



(3.126) 



|| u 1 1 2 

and SNR := can be interpreted as the average signal to noise ratio per diversity 
branch. Noting that ||u || 2 is the average total energy received per bit of information, 
we can define S b := ||u|| 2 . Hence, the SNR per branch is 1/L ■ Sb/No- Observe that 
the factor of 1/L accounts for the splitting of energy due to spreading: the larger the 
spread bandwidth W, the larger L is, and the more diversity one gets, but there is less 
energy in each branch. 13 As L — » oo, i IM 2 converges to 1 with probability 1 by 

the law of large numbers, and from (3.124) we see that 



Pe * Q {y/2E b /N^j , 



(3.127) 



i.e., the performance of the AWGN channel with the same Sb/No is asymptotically 
achieved. 

The above analysis assumes an equal amount of energy in each tap. In a typical 
multipath delay profile, there is more energy in the taps with shorter delays. The 
analysis can be extended to the cases when the hf s have unequal variances as well. 
(See Section 14.5.3 in [67]). 

13 This is assuming a very rich scattering environment; leading to many paths, all of equal energy. 
In reality, however, there are just a few paths that are strong enough to matter. 
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3.4.4 Orthogonal Frequency Division Multiplexing 

Both the single-carrier system with ISI equalization and the DS spread spectrum sys- 
tem with Rake reception are based on a time-domain view of the channel. But we 
know that if the channel is linear time-invariant, sinusoids are eigenfunctions and they 
get transformed in a particularly simple way. 1S1 occurs in a single-carrier system 
because the transmitted signals are not sinusoids. This suggests that if the channel 
is under-spread (i.e., the coherence time is much larger than the delay spread) and is 
therefore approximately time- invariant for a sufficiently long time-scale, then transfor- 
mation into the frequency domain can be a fruitful approach to communication over 
frequency-selective channels. This is the basic idea behind OFDM. 

We begin with the discrete-time baseband model 

y[m\ = E hg[m] x[m — £\ + w[m]. (3.128) 

i 

For simplicity, we first assume that for each P, the I th tap is not changing with m and 
hence the channel is linear time invariant. Again assuming a finite number of non-zero 
taps L \= TfiV, we can rewrite the channel model in (3.128) as 

L—l 

y[m] = E hgx[m — £] + w[m\. (3.129) 

e=o 

Sinusoids are eigenfunctions of LTI systems, but they are of infinite duration. If we 
transmit over only a finite duration, say N c symbols, then the sinusoids are no longer 
eigenfunctions. One way to restore the eigenfunction property is by adding a cyclic 
prefix to the symbols. For every block of symbols of length N c , denoted by 

d= [d[0ld[l],...,d[N c -l]] t , 

we create an N c + L — 1 input block as 

x = [d[N c - L + 1], d[N c — L + 2], . . . , d[N c - 1], d[0], d[l ), . . . , d,[N c - 1]]* , (3.130) 

i.e., we add a prefix of length L — l consisting of data symbols rotated cyclically 
(Figure 3.20). With this input to the channel (3.129), consider the output 

L—l 

y[m\ = hgx\rn — £} + w[m], m — 1 . . . N c + L — 1. 
e=o 

The ISI extends over the first L—l symbols and the receiver ignores it by considering 
the output over the time interval m G [L, N c + L — 1], Due to the additional cyclic 
prefix, the output over this time interval (of length N c ) is 

L—l 

y[m\ = hgd [(m — L — £) modulo N c ] + w[m]. 

1=0 



(3.131) 
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Figure 3.20: The cyclic prefix operation. 



See Figure 3.21. 

Denoting the output of length N c by 

y = [y[ L \,---,y[ N c + L-i}}* , 
and the channel by a vector of length N c 

h = [h o ,h 1 ,...,h L - 1 ,0,...,0] t , (3.132) 



(3.131) can be written as 

y = h (g) d + w. (3.133) 

Here we denoted 

w = [w[L ], . . . , w[N c + L — 1]]* , (3.134) 

as a vector of i.i.d. CJ\f(0,N o ) random variables. We also used the notation of ® to 
denote the cyclic convolution in (3.131). Recall that the discrete Fourier transform 
(DFT) of d is defined to be 
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Figure 3.21: Convolution between the channel (h) and the input (x) formed from the 
data symbols (d) by adding a cyclic prefix. The output is obtained by multiplying the 
corresponding values of x and h on the circle, and output at different times are obtained 
by rotating the x - values with respect to the h- values. The current configuration yields 
the output y[L\. 
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Taking the discrete Fourier transform (DFT) of both sides of (3.133) and using the 
identity 

DFT (h <S> d) n = s/Kc DFT (h) n • DFT (d) n , n = 0, . . . , N c - 1, (3.136) 

we can rewrite (3.133) as 

Vn h n d n ev n , n 0, • • • , N c 1. (3.137) 

Here we have denoted w 0 , . . . ,wn c -i as the lV c -point DFT of the noise vector w[l], . . . ,w[N c ]. 
The vector [ho, . . . is defined as the DFT of the L-tap channel h, scaled by 

VN C , 

K = ^2 he exp \ • (3.138) 

The scaling ensures that the n th component h n is equal to the frequency response of 
the channel (see (2.20)) at / = nW/N c . 

We can redo everything in terms of matrices, a viewpoint which will prove par- 
ticularly useful in Chapter 7 when we will draw a connection between the frequency- 
selective channel and the MIMO channel. The circular convolution operation u = h(g>d 
can be viewed as a linear transformation 

u = Cd, (3.139) 

where 

ho 0 0 Hl~ i Hl - 2 • h\ 

q hi h 0 0 • 0 /ir-i • h 2 ^ 

0 -0 Hl~ i hn - 2 • h\ ho 

is a circulant matrix, i.e., the rows are cyclic shifts of each other. On the other hand, 
the DFT of d can be represented as an 7V c -length vector Ud, where U is the unitary 
matrix with its (k, n) th entry equal to 

j2nkn ^j * „ _ () y r - 1 . (3.141) 

This can be viewed as a coordinate change, expressing d in the basis defined by the 
rows of U. Equation (3.136) is equivalent to 

Uu = AUd, (3.142) 

where A is the diagonal matrix with diagonal entries the DFT of h, i.e., 

A nn = h n := (y/VcTJh) , n = 0, . . . , N c - 1. 
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Comparing (3.139) and (3.142), we come to the conclusion that 

C = U -1 AU. (3.143) 

Equation (3.143) is the matrix version of the key DFT property (3.136). In geomet- 
ric terms, this means that the circular convolution operation is diagonalized in the 
coordinate system defined by the rows of U, and the eigenvalues of C are the DFT 
coefficients of the channel h. Equation (3.133) can thus be written as 

y = Cd + w = U -1 AUd + w. (3.144) 

This representation suggests a natural rotation at the input and at the output to 
convert the channel to a set of non- interfering channels with no ISI. In particular, the 
actual data symbols (denoted by the length N c vector d) in the frequency domain are 
rotated through the IDFT (inverse DFT) matrix LD 1 to arrive at the vector d. At the 
receiver, the output vector of length N c (obtained by ignoring the first L symbols) is 
rotated through the DFT matrix U to obtain the vector y. The final output vector y 
and the actual data vector d are related through 

y n = h n d n + w n , n = 0, . . . , N c — 1. (3.145) 

We have denoted w := Uw as the DFT of the random vector w and we see that 
since w is isotropic, w has the same distribution as w, i.e., a vector of i.i.d. CM{ 0, N 0 ) 
random variables (c.f. (A. 26) in Appendix A). 

These operations are illustrated in Figure 3.22, which affords the following inter- 
pretation. The data symbols modulate N c tones or sub- carriers which occupying the 
bandwidth W and arc uniformly separated by The data symbols on the sub-carriers 
are then converted (through the IDFT) to time domain. The procedure of introducing 
the cyclic prefix before transmission allows for the removal of ISI. The receiver ignores 
the part of the output signal containing the cyclic prefix (along with the ISI terms) 
and converts the length- N c symbols back to the frequency domain through a DFT. 
The data symbols on the sub-carriers are maintained to be orthogonal as they prop- 
agate through the channel and hence go through narrowband parallel channels. This 
interpretation justifies the name of OFDM for this communication scheme. Finally, 
we remark that DFT and IDFT can be very efficiently implemented (and denoted as 
FFT and IFFT, respectively) whenever N c is a power of 2. 

OFDM Block Length 

The OFDM scheme converts communication over a multipath channel into communi- 
cation over simpler parallel narrowband channels. However, this simplicity is achieved 
at a cost of underutilizing two resources, resulting in a loss of performance. First, the 
cyclic prefix occupies an amount of time which cannot be used to communicate data. 
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Figure 3.22: The OFDM transmission and reception schemes. 

This loss amounts to a fraction N L +L of the total time. The second loss is in the power 
transmitted. A fraction N L +L of the average power is allocated to the cyclic prefix and 
cannot be used towards communicating data. Thus to minimize the overhead (in both 
time and power) dne to the cyclic prefix we prefer to have N c as large as possible. The 
time-varying nature of the wireless channel, however, constrains the largest value N c 
can reasonably take. 

We started the discussion in this section by considering a simple channel model 
(3.129) that did not vary with time. If the channel is slowly time varying (as discussed 
in Section 2.2.1, this is a reasonable assumption) then the coherence time T c is much 
larger than the delay spread T d (the underspread scenario). For underspread channels, 
the block length of the OFDM communication scheme N c can be chosen larger than the 
multipath length T d W , but still much smaller than the coherence block length T C W . 
Under these conditions, the channel model of linear time invariance approximates a 
slowly time varying channel over the block length N c . 

The constraint on the OFDM block length can also be understood in the fre- 
quency domain. A block length of N c corresponds to an inter-sub-carrier spacing equal 
to W/N c . In a wireless channel, the Doppler spread introduces uncertainty in the 
frequency of the received signal; from Table 2.1 we see that the Doppler spread is 
inversely proportional to the coherence time of the channel: D s = 1/4 T c . For the 
inter-sub-carrier spacing to be much larger than the Doppler spread, the OFDM block 
length N c should be constrained to be much smaller than T C W. This is the same 
constraint as above. 

Apart from an underutilization of time due to the presence of the cyclic prefix, we 
also mentioned the additional power due to the cyclic prefix. OFDM schemes that 
put a zero signal instead of the cyclic prefix have been proposed to reduce this loss. 
However due to the abrupt transition in the signal, such schemes introduce harmonics 
that are difficult to filter in the overall signal. Further, the cyclic prefix can be used 
for timing and frequency acquisition in wireless applications, and this capability would 
be lost if a zero signal replaces the cyclic prefix. 
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Frequency Diversity 

Let us revert to the non-overlapping narrowband channel representation of the ISI chan- 
nel in (3.145). The correlation between the channel frequency coefficients ho , ... , h^^-i 
depends on the coherence bandwidth of the channel. From our discussion in Section 2.3, 
we have learned that the coherence bandwidth is inversely proportional to the multi- 
path spread. In particular, we have from (2.48) that 



W c = 



1 

2T d 



W 

2 V 



where we use our notation for L as denoting the length of the ISI. Since each sub-carrier 
is wide, we expect approximately 



N C W C = N c 

W 2 L 

as the number of neighboring sub-carriers whose channel coefficients are heavily cor- 
related (see Exercise 3.29). One way to exploit the frequency diversity is to consider 
ideal interleaving across the sub-carriers (analogous to the time interleaving done in 
Section 3.2) and consider the model of (3.31) 



y e = h e xe + w £ , f = \,...L. 

The difference is that now l represents the sub-carriers while it is used to denote 
time in (3.31). However, with the ideal frequency interleaving assumption we retain 
the same independent assumption on the channel coefficients. Thus, the discussion of 
Section 3.2 on schemes harnessing diversity is directly applicable here. In particular, 
an L-fold diversity gain (proportional to the number of ISI symbols L) can be obtained. 
Since the communication scheme is over sub-carriers, the form of diversity is due to 
the frequency-selective channel and is termed frequency diversity (as compared to the 
time diversity discussed in Section 3.2 which arises due to the time variations of the 
channel). 



Summarjy 3.3 Communication over Frequency-Selective Channels 

We have studied three approaches to extract frequency diversity in a 
frequency-selective channel (with L taps). We summarize their key attributes and 
compare their implementational complexity. 



1. Single-carrier with ISI equalization: 

Using maximum likelihood sequence detection (MLSD), full diversity of L can be 
achieved for uncoded transmission sent at symbol rate. 
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MLSD can be performed by the Viterbi algorithm. The complexity is constant 
per symbol time but grows exponentially with the number of taps L. 

The complexity is entirely at the receiver. 

2. Direct sequence spread spectrum: 

Information is spread, via a pseudonoise sequence, across a bandwidth much 
larger than the data rate. ISI is typically negligible. 

The signal received along the L nearly orthogonal diversity paths is maximal-ratio 
combined using the Rake receiver. Full diversity is achieved. 

Compared to MLSD, complexity of the Rake receiver is much lower. ISI is 
avoided because of the very low spectral efficiency per user, but the spectrum is 
typically shared between many interfering users. Complexity is thus shifted to the 
problem of interference management. 

3. Orthogonal frequency division multiplexing: 

Information is modulated on non-interfering sub-carriers in the frequency domain. 

The transformation between the time and frequency domains is done by means of 
adding/subtracting a cyclic prefix and IDFT/DFT operations. This incurs an 
overhead in terms of time and power. 

Frequency diversity is attained by coding over independently faded sub-carriers. 
This coding problem is identical to that for time diversity. 

Complexity is shared between the transmitter and the receiver in performing the 
IDFT and DFT operations; the complexity of these operations is insensitive to 
the number of taps, scales moderately with the number of sub-carriers N c and is 
very manageable with current implementation technology. 

Complexity of diversity coding across sub-carriers can be traded off with the 
amount of diversity desired. 
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3.5 Impact of Channel Uncertainty 

In the past few sections we assumed perfect channel knowledge so that coherent com- 
bining can be performed at the receiver. In fast varying channels, it may not be easy 
to estimate accurately the phases and magnitudes of the tap gains before they change. 
In this case, one has to understand the impact of estimation errors on performance. 
In some situations, noncoherent detection, which does not require an estimate of the 
channel, may be the preferred route. In Section 3.1.1, we have already come across a 
simple noncoherent detector for fading channels without diversity. In this section, we 
will extend this to channels with diversity. 

When we compared coherent and noncoherent detection for channels without di- 
versity, the difference was seen to be relatively small (c.f. Figure 3.2). An important 
question is what happens to that difference as the number of diversity paths L in- 
creases. The answer depends on the specific diversity scenario. We first focus on the 
situation where channel uncertainty has the most impact: DS spread spectrum over 
channels with frequency diversity. Once we understand this case, it is easy to extend 
the insights to other scenarios. 

3.5.1 Noncoherent Detection for DS Spread Spectrum 

We considered this scenario in 3.4.3, except now the receiver has no knowledge of the 
channel gains he s. As we saw in Section 3.1.1, no information can be communicated in 
the phase of the transmitted signal (in particular, antipodal signaling cannot be used) 
in conjunction with noncoherent detection. Instead, we consider binary orthogonal 
modulation, 14 i.e. , and x# are orthogonal and ||xa|| = ||xs||. 

Recall that the central pseudonoise property of the transmitted sequences in DS 
spread spectrum is that the shifted versions are nearly orthogonal. For simplicity of 
analysis, we continue with the assumption that shifted versions of the transmitted 
sequence are exactly orthogonal; this holds for both x^ and xg here. We make the 
further assumption that versions of the two sequences with different shifts are also 
orthogonal to each other, i.e., (x^)*(x^) = 0 for i ^ (! (the so-called zero cross- 
correlation property). This approximately holds in many spread spectrum systems. For 
example, in the uplink of IS-95, the transmitted sequence is obtained by multiplying the 
selected codeword of an orthogonal code by a (common) pseudonoise ±1 sequence, so 
that the low cross-correlation property carries over from the auto-correlation property 
of the pseudonoise sequence. 

Proceeding as in the analysis of coherent detection, we start with the channel model 
in vector form (3.122) and observe that the projection of y onto the 2 L orthogonal 

14 Typically M - ary orthogonal modulation is used. For example, the uplink of IS-95 employs non- 
coherent detection of 64-ary orthogonal modulation. 
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vectors {x^/Hx^H, x^/||xs||}£ yields 2 L sufficient statistics 

= h e x i + w%\. £ = 1, . . . , L - 1 

= hex 2 + Wb\ £ - 1 - 1 

where v $' s and s are i.i.d. CJ\f(0, N 0 ), and 

) if x^4 is transmitted 
< (3.146) 

) if xb is transmitted 

This is essentially a generalization of the noncoherent detection problem in Section 3.1.1 
from 1 branch to L branches. Just as in the 1 branch case, a square-law type detector 
is the optimal noncoherent detector: decide in favor of x^ if 

DhT>Ei4T, < 3 - 147 > 

£=o e=o 




otherwise decide in favor of x^. The performance can be analyzed as in the 1 branch 
case: the error probability has the same form as in (3.125), but with /i given by 



1 / L ■ £ b /N 0 
2 + 1 /L-S b /No’ 



(3.148) 



where £5 := Hx^ || 2 . (See Exercise 3.32.) As a basis of comparison, the performance 
of coherent detection of binary orthogonal modulation can be analyzed as for the 
antipodal case; it is again given by (3.125) but with fi given by (see Exercise 3.34): 






1/L ■ £ b /N 0 
2 + 1/L ■ Si,/Nq 



(3.149) 



It is interesting to compare the performance of coherent and noncoherent detection 
as a function of the number of diversity branches. This is shown in Figures 3.23 and 
3.24. For L = 1, the gap between the performance of both schemes is small, but they 
are bad anyway, as there is a lack of diversity. This point was already made in Section 
3.1. As L increases, the performance of coherent combining improves monotonically 
and approaches the performance of an AWGN channel. In contrast, the performance 
of noncoherent detection first improves with L but then degrades as L is increased 
further. The initial improvement comes from a diversity gain. There is however a 
law of diminishing return on the diversity gain. At the same time, when L becomes 
too large, the SNR per branch becomes very poor and noncoherent combining cannot 
effectively exploit the available diversity. This leads to an ultimate degradation in 
performance. In fact, it can be shown that as L — » 00 , the error probability approaches 
1 / 2 . 
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# of taps L 



Figure 3.23: Comparison of error probability under coherent detection (solid) and 
noncoherent detection (dotted), as a function of the number of taps L. Here £b/N 0 = 
lOdB. 




Figure 3.24: Comparison of error probability under coherent detection (solid) and 
noncoherent detection (dotted), as a function of the number of taps L. Here Sb/Nq = 



15dB. 
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3.5.2 Channel Estimation 

The significant performance difference between coherent and noncoherent combining 
when the number of branches is large suggests the importance of channel knowledge 
in wideband systems. We have assumed perfect channel knowledge when we analyzed 
the performance of the coherent Rake receiver, but in practice, the channel taps have 
to be estimated and tracked, ft is therefore important to understand the impact of 
channel measurement errors on the performance of the coherent combiner. We now 
turn to the issue of channel estimation. 

In data detection, the transmitted sequence is one of several possible sequences 
(representing the data symbol). In channel estimation, the transmitted sequence is 
assumed to be known at the receiver. In a pilot-based scheme, a known sequence 
(called a pilot, sounding tone, or training sequence) is transmitted and this is used to 
estimate the channel. 15 In a decision- feedback scheme, the previously detected symbols 
are used instead to update the channel estimates. If we assume that the detection is 
error free, then the development below applies to both pilot-based and decision-directed 
schemes. 

Focus on one symbol duration, and suppose the transmitted sequence is a known 
pseudonoise sequence u. We return to the channel model in vector form (c.f. (3.122)) 



L—l 

y — he + w, (3.150) 

t=o 

We see that since the shifted versions of u are orthogonal to each other and the taps 
are assumed to be independent of each other, projecting y onto u^/ll u ^ll will yield 
a sufficient statistic to estimate hn (see Summary 3) 

:= (u^)*y = /i^||u^|| + -u/ £) = VShi + w^\ (3.151) 

where S : = ||u^|| 2 . This is implemented by filtering the received signal by a filter 
matched to u and sampling at the appropriate chip time. This operation is the same 
as the first stage of the Rake receiver, and the channel estimator can in fact be combined 
with the Rake receiver if done in a decision-directed mode. (See Figure 3.19.) 

Typically, channel estimation is obtained by averaging over several such measure- 
ments. For simplicity, we assume that the channel is constant over K symbol times, 
and we obtain K such measurements over these symbol times: 

:=VSh e + w^ 7 k = 1, . . . K. (3.152) 

15 The downlink of IS-95 uses a pilot, which is assigned its own pseudonoise sequence and transmitted 
superimposed on the data. 
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Assuming that he ~ CA/"(0, 1 /L), the minimum mean-square estimate of h e given these 
measurements is (c.f. (A. 84) in Summary 3) 




The mean-square error associated with this estimate is (c.f. (A. 85) in Summary 3) 



1 

z 



1 



1 + 



K£ ’ 
LN 0 



(3.154) 



the same for all branches. 

The key parameter affecting the estimation error is 



SNR est := — . (3.155) 

Ll \ 0 

When SNRgst 1, the mean-square estimation error is much smaller than the variance 
of he (equal to 1 / L) and the impact of the channel estimation error on the performance 
of the coherent Rake receiver is not significant; perfect channel knowledge is a reason- 
able assumption in this regime. On the other hand, when SNR est <C 1, the mean-square 
error is close to 1/L, the variance of he- In this regime, we hardly have any informa- 
tion about the channel gains and the performance of the coherent combiner cannot 
be expected to be better than the noncoherent combiner, which we know has poor 
performance whenever L is large. 

How should we interpret the parameter SNR est ? Since the channel is constant over 
the coherence time T c , we can interpret K£ as the total received energy over the 
channel coherence time T c . We can rewrite SNR est as 



SNR est = —A (3.156) 

Ll \ 0 

where P is the received power of the signal from which channel measurements are 
obtained. Hence, SNR es t can be interpreted as the signal-to-noise ratio available to 
estimate the channel per coherence time per tap. Thus, channel uncertainty has a 
significant impact on the performance of the Rake receiver whenever this quantity is 
significantly below 0 dB. 

If the measurements are done in a decision-feedback mode, P is the received power 
of the data stream itself. If the measurements are done from a pilot, then P is the 
received power of the pilot. On the downlink of a CDMA system, it is more economical 
to have a pilot common to all users; moreover, the power allocated to the pilot can 
be larger than the power of the signals for the individual users. This results in a 
larger SNR est , thus makes coherent combining easier. On the uplink, however, it is not 
possible to have a common pilot, and the channel estimation will have to be done with 
a weaker pilot allotted to the individual user, or even noncoherent ly. With a lower 
received power from the individual users, SNR est can be considerably smaller. 
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3 . 5.3 Other Diversity Scenarios 

There are two reasons why wideband DS spread spectrum systems are significantly 
impacted by channel uncertainty: 

• the amount of energy per resolvable path decreases inversely with increasing num- 
ber of paths, making their gains harder to estimate when there are many paths; 

• the number of diversity paths depends both on the bandwidth and the delay spread 
and given these parameters, the designer has no control over this number. 

What about in other diversity scenarios? 

In antenna diversity with L receive antennas, the received energy per antenna is the 
same regardless of the number of antennas, so the channel measurement problem is the 
same as with a single receive antenna and does not become harder. In antenna diversity 
with L transmit antennas, the received energy per diversity path does decrease with the 
number of antennas used, but certainly we can restrict the number L to be the number 
of different channels that can be reliably learnt by the receiver. A similar scenario 
is true in time diversity where we can choose the depth of interleaving, effectively 
controlling the number of diversity paths L over which we jointly code. 

How about in OFDM systems with frequency diversity? Here, the designer has 
control over how many sub-carriers to spread the signal energy over. Thus while the 
number of available diversity branches L may increase with the bandwidth, the signal 
energy can be restricted to a fixed number of sub-carriers II < L over any one OFDM 
time block. Such communication can be restricted to concentrated time-frequency 
blocks and Fig 3.25 visualizes one such scheme (for L' = 2), where the choice of the 
L' sub-carriers is different for different OFDM blocks and is hopped over the entire 
bandwidth. Since the energy in each OFDM block is concentrated within a fixed 
number of sub-carriers at any one time, coherent reception is possible. On the other 
hand, the maximum diversity gain of L can still be achieved by coding across the 
sub-carriers within one OFDM block as well as across different blocks. 

One possible drawback is that since the total power is only concentrated within 
a subset of sub-carriers, the total degrees of freedom available in the system are not 
utilized. This is certainly the case in the context of point-to-point communication; in 
a system with other users sharing the same bandwidth, however, the other degrees of 
freedom can be utilized by the other users and need not go wasted. In fact, one key 
advantage of OFDM over DS spread spectrum is the ability to maintain orthogonality 
across multiple users in a multiple access scenario. We will return to this point in 
Chapter 4. 
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Time 



Figure 3.25: An illustration of a scheme that uses only a fixed part of the bandwidth 
at every time. Here, one small square denotes a single sub-carrier within one OFDM 
block. The time-axis indexes the different OFDM blocks; the frequency-axis indexes 
the different sub-carriers. 
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Chapter 3: The Main Plot 



Baseline 



We first looked at detection on a narrowband flat fading Rayleigh channel. Under 
both coherent and noncoherent detection, the error probability behaves like 

p e « SNR -1 (3.157) 

at high SNR. In contrast, the error probability decreases exponentially with the 
SNR in the AWGN channel. The typical error event for the fading channel is due 
to the channel being in a deep fade rather than the Gaussian noise being large. 

Diversity 



Diversity was presented as an effective approach to improve performance 
drastically by providing redundancy across independently faded branches. 

Three modes of diversity were considered: 

• time - the interleaving of coded symbols over different coherence time periods; 

• space - the use of multiple transmit and/or receive antennas; 

• frequency - the use of a bandwidth greater than the coherence bandwidth of the 
channel. 



In all cases, a simple scheme that repeats the information symbol across the 
multiple branches achieves full diversity. With L i.i.d. Rayleigh branches of 
diversity, the error probability behaves like 

p e ttc-Sm~ L (3.158) 



at high SNR. 

Examples of repetition schemes: 

• repeating the same symbol over different coherence periods; 

• repeating the same symbol over different transmit antennas one at a time; 

• repeating the same symbol across OFDM sub-carriers in different coherence bands; 
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• transmitting a symbol once every delay spread in a frequency-selective channel so 
that multiple delayed replicas of the symbol are received without interference. 



Code Design and Degrees of Freedom 



More sophisticated schemes cannot achieve higher diversity gain but can provide a 
coding gain by improving the constant c in (3.158). This is achieved by utilizing 
the available degrees of freedom better than in the repetition schemes. 

Examples: 

• rotation and permutation codes for time diversity and for frequency diversity in 
OFDM; 

• Alamouti scheme for transmit diversity; 

• uncoded transmission at symbol rate in a frequency-selective channel with ISI 
equalization. 



Criteria to design schemes with good coding gain were derived for the different 
scenarios by using the union bound (based on pairwise error probabilities) on the 
actual error probability: 

• product distance between codewords for time diversity; 

• determinant criterion for space-time codes. 



Channel Uncertainty 



The impact of channel uncertainty is significant in scenarios where there are many 
diversity branches but only a small fraction of signal energy is received along each 
branch. Direct sequence spread spectrum is a prime example. 

The gap between coherent and noncoherent schemes is very significant in this 
regime. Noncoherent schemes do not work well as they cannot combine the signals 
along each branch effectively. 

Accurate channel estimation is crucial. Given the amount of transmit power 
devoted to channel estimation, the efficacy on detection performance depends on 
the key parameter SNR est , the received SNR per coherence time per diversity 
branch. If SNR est 0 dB, then detection performance is near coherent. If 
SNR est <C 0 dB, then effective combining is impossible. 
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Impact of channel uncertainty can be ameliorated in some schemes where the 
transmit energy can be focused on smaller number of diversity branches. 
Effectively SNR est is increased. OFDM is an example. 
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3.6 Bibliographical Notes 

Reliable communication over fading channels have been studied since the 1970s. Im- 
proving the peformance via diversity is also an old topic. Standard digital communi- 
cation texts such as Chapter 14 of Proakis [67] contains many formulas for the perfor- 
mance of coherent and non-coherent diversity combiners, which we use liberally here. 
The product distance criterion for improving the coding gain while still achieving full 
diversity is relatively new. This was studied in [?] and the rotation example is taken 
from [?, ?]. Transmit antenna diversity has been studied extensively in the late 1990s 
and the determinant code design criterion was introduced by [76]. The delay diver- 
sity scheme was introduced by Seshadri [?]. The Alamo uti scheme was introduced 
by Alamouti [3] and generalized to orthogonal designs in [78]. The diversity analysis 
of the decorrelator was performed in [100] in the context of a space-division multiple 
access system with multiple receive antennas. 

The topic of equalization has received huge amount of attention. The analysis of 
MLSE is adopted from [?]. 

The OFDM approach to communicate over a wideband channel was proposed in 
[?]. Circular convolution and the DFT are classical undergraduate material in digital 
signal processing (Chapter 8, and Section 8.7.5 in particular of [62]). 

The spread spectrum approach to harness frequency diversity has been well sum- 
marized (along with a system view) by Viterbi [99]. The Rake receiver was designed 
by [66]. The impact of channel uncertainty on the performance was studied by various 
authors, including [61, 80]. 
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Exercises 

EXERCISE 3.1. Verify (3.19) and the high SNR approximation (3.21). Hint : Write the 
expression as a double integral and interchange the order of integration. 

Exercise 3.2. In Section 3.1.2 we studied the performance of antipodal signaling 
under coherent detection over a Rayleigh fading channel. In particular, we saw that 
the error probability p e decreases like 1/SNR. In this question, we study a deeper 
characterization of the behavior of p e with increasing SNR. 

1. A precise way of saying that p e decays like 1/SNR with increasing SNR is the 
following: 

lim p P ■ SNR = c, 

SNR-+00 

where c is a constant. Identify the value of c. 

2. Now we want to test how robust the above result is with respect to the fading 
distribution. Let h be the channel gain, and suppose |/?,| 2 has an arbitrary con- 
tinuous pdf / satisfying /( 0) > 0. Does this give enough information to compute 
the high SNR error probability like in the previous part? If so, compute it. If 
not, specify what other information you need. Hint: You may need to inter- 
change limit and integration in your calculations. You can assume this can be 
done without worrying about making your argument rigorous. 

3. Suppose now we have L independent branches of diversity with gains h±, ... ,Ixl, 
and \h(\ 2 having an arbitrary distribution as in the previous part. Is there enough 
information for you to find the high SNR performance of repetition coding and 
coherent combining? If so, compute it. If not, what other information do you 
need? 

4. Using the result in the previous part or otherwise, compute the high SNR perfor- 
mance under Rician fading. How does the parameter n affect the performance? 

EXERCISE 3.3. This exercise shows how the high SNR slope of the probability of error 
(3.19) versus SNR curve can be obtained using a typical error event analysis, without 
the need for directly carrying out the integration. 

Fix e > 0 and define the e- typical error event E e : 

E e := {h : \h\ 2 < l/SNR 1+e }. (3.159) 
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2. Show that 



3. Hence conclude that 



Mm > (1 e). 

SNR^oo log SNR 


(3.161) 


r log Pe , 

hm = — 1. 

SNR— >oo log SNR 


(3.162) 



This says that that the asymptotic slope of the error probability versus SNR plot 
(in dB/dB scale) is —1. 

Exercise 3.4. In Section 3.1.2, we saw that there is a 4 dB energy loss when using 4- 
PAM on only the I-channel rather than using QPSK on both the I and the Q channels, 
although both modulations convey two bits of information. Compute the corresponding 
loss when one wants to transmit k bits of information using 2 fc -PAM rather than 2 k - 
QAM. You can assume k is even. How does the loss depend on kl 

Exercise 3.5. Consider the use of the differential BPSK scheme proposed in Section 
3.1.3 for the Rayleigh flat fading channel. 



1. Find a natural non-coherent scheme to detect u[m based on y[m — 1] and y[m], 
assuming the channel is constant across the two symbol times. Your scheme does 
not have to be the ML detector. 

2. Analyze the performance of your detector at high SNR. You may need to make 
some approximations. How does the high SNR performance of your detector 
compare to that of the coherent detector? 

3. Repeat your analysis for differential QPSK. 



EXERCISE 3.6. In this exercise we further study coherent detection in Rayleigh fading. 

1. Verify Equation (3.37). 

2. Analyze the error probability performance of coherent detection of binary or- 
thogonal signaling with L branches of diversity, under an i.i.d. Rayleigh fading 
assumption (i.e., verify Equation (3.149)). 

Exercise 3.7. In this exercise, we study the performance of the rotated QAM code 
in Section 3.2.2. 

1. Give an explicit expression for the exact pairwise error probability P {x^ -> x^} 
in (3.49). Hint: The techniques from Exercise 3.1 will be useful here. 
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2. This pairwise error probability was upper bounded in (3.54). Show that the 
product of SNR and the difference between the upper bound and the actual 
pairwise error probability goes to zero with increasing SNR. In other words, the 
upper bound in (3.54) is tight up to the leading term in 1/SNR. 

EXERCISE 3.8. In the text, we mainly use real symbols to simplify the notation. In 
practice, complex constellation are used (i.e., symbols are sent along both the I and 
Q components). The simplest complex constellation is QPSK: the constellation is 
{a(l + j),a(l - j),a(-l - j),a(-l + j)j. 

1. Compute the error probability of QPSK detection for a Rayleigh fading channel 
with repetition coding over L branches of diversity. How does the performance 
compare to a scheme which uses only real symbols? 

2. In Section 3.2.2, we developed a diversity scheme based on rotation of real sym- 
bols (thus using only the / channel). One can develop an analogous scheme for 
QPSK complex symbols, using a 2 by 2 complex unitary matrix instead. Find 
an analogous pairwise code-design criterion as in the real case. 

3. Real orthonormal matrices are special cases of complex unitary matrices. Within 
the class of real orthonormal matrices, find the optimal rotation to maximize your 
criterion. 

4. Find the optimal unitary matrix to maximize your criterion. 

Exercise 3.9. In Section 3.2.2, we have used a rotated QPSK constellation to demon- 
strate the possible improvement over repetition coding in a time diversity channel with 
two diversity paths. Continuing with the same model, now consider transmitting at 
a higher rate (than the 4 symbols transmitted by the QPSK scheme) using a QAM 
constellation (with 4n 2 points, for some n > 1). Consider rotating the QAM by a 
rotation matrix of the form in (3.46). Using the performance criterion of the minimum 
squared product distance, construct the optimal rotation matrix. 

Exercise 3.10. In Section 3.2.2, we looked at the example of the rotation code to 
achieve time diversity (with the number of branches, L , equal to 2). Another coding 
scheme is the permutation code. Shown in Figure 3.26 are two 16-QAM constellations. 
Each codeword in the permutation code for L — 2 is obtained by picking a pair of 
points, one from each constellation, which are represented by the same icon. The 
codeword is transmitted over two (complex) symbol times. 

1. Why do you think this is called a permutation code? 

2. What is the data rate of this code? 

3. Compute the diversity gain and the minimum product distance for this code. 
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Figure 3.26: A permutation code. 

4. How does the performance of this code compare to the rotation code of the 
same rate, in terms of the transmit power required? (When we discussed the 
rotation code in Section 3.2, for simplicity we only used the (real) 1 channel, but 
in practice and for fair comparison with the code considered here, one should 
simultaneously use the (imaginary) Q channel as well to double the rate. Since 
the two channels are orthogonal, one can apply the same rotation separately to 
the symbols transmitted in the 2 channels.) 



Exercise 3.11. In the text, we considered the use of rotation codes to obtain time 
diversity. Rotation codes are designed specifically for fading channels. Alternatively, 
one can use standard AWGN codes like binary linear block codes. This question looks 
at the diversity performance of such codes. 

Considered a perfectly interleaved Rayleigh fading channel: 

yg = h e xg + wg, £=1 L 

where hg s and wg s are i.i.d. CJ\f( 0, 1) and CA/”(0, N 0 ) random variables respectively. 
A (L, k) binary linear block code is specified by a k by L generator matrix G whose 
entries are 0’s or l’s. k information bits form a h-dimensional binary- valued vector 
b which is mapped 16 into the binary codeword c = G f b of length L , which is then 
mapped into L BPSK symbols and transmitted over the fading channel. The receiver 
is assumed to have a perfect estimate of the channel gains hg ’s. 

1. Compute a bound on the error probability of ML decoding in terms of the SNR 
and parameters of the code. Hence, compute the diversity gain in terms of code 
parameter (s). 



16 Addition and multiplication are done in the binary field. 
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2 . 



Use your result in (1) to compute the 
generator matrix: 



G = 



1 

0 



diversity gain of the (3, 2) code with 



(3.163) 



How does the performance of this code compare to the rate 1/2 repetition code? 



3. The ML decoding is also called soft decision decoding as it takes the entire re- 
ceived vector y and finds the transmitted codeword closest in Euclidean dis- 
tance to it. Alternatively, a sub-optimal but lower complexity decoder uses hard- 
decision decoding , which for all i first makes a hard decision cp on the £ th trans- 
mitted coded symbol based only on the corresponding received symbol yp, and 
then find the codeword that is closest in Hamming distance to c. Compute the 
diversity gain of this scheme in terms of basic parameters of the code. How does 
it compare to the diversity gain achieved by soft decision decoding? Compute 
the diversity gain of the code in (2) under hard decision decoding. 



4. Suppose now yon still do hard decision decoding except that yon are allowed to 
also declare an “erasure” on some of the transmitted symbols (i.e. you can refuse 
to make a hard decision on some of the symbols.) Can you design a scheme that 
yields a better diversity gain than the scheme in part (3)? Can you do as well 
as soft decision decoding? Justify your answers. Try your scheme out on the 
example in part (2). Hint : the trick is to figure out when to declare an erasure. 
You may want to start thinking of the problems in terms of the example in part 
(2). The typical error event view in Exercise 3.3 may also be useful here. 



Exercise 3.12. 



Exercise 3.13. In our study of diversity models (c.f. (3.31)), we have modelled the 
L branches to have independent fading coefficients. Here we explore the impact of 
correlation between the L diversity branches. In the time diversity scenario, consider 
the correlated model: /i[l], . . . , h[L\ are jointly complex Gaussian with zero mean and 
covariance K/ ( , (CA? (0, K/J in our notation). 

1. Redo the diversity calculations for repetition coding (Section 3.2.1) for this cor- 
related channel model by calculating the rate of decay of error probability with 
SNR. What is the dependence of the asymptotic (in SNR) behavior of the typical 
error event on the correlation K/ ( ? You can answer this by characterizing the 
rate of decay of (3.42) at high SNR (as a function of K^). 

2. We arrived at the product distance code design criterion to harvest coding gain 
along with time diversity in Section 3.2.2. What is the analogous criterion for 
correlated channels? Hint: Jointly complex Gaussian random vectors are related 
to i.i.d. complex Gaussian vectors via a linear transformation that depends on 
the covariance matrix. 
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3. For transmit diversity with independent fading across the transmit antennas, 
we have arrived at the generalized product distance code design criterion in 
Section 3.3.2. Calculate the code design criterion for the correlated fading channel 
here (the channel h in (3.80) is now CAT (0, KQ). 

EXERCISE 3.14. The optimal coherent receiver for repetition coding with L branches of 
diversity is a maximal ratio combiner. For implementation reasons, a simpler receiver 
one often builds is a selection combiner. It does detection based on the received signal 
along the branch with the strongest gain only, and ignores the rest. For the i.i.d. 
Rayleigh fading model, analyze the high SNR performance of this scheme. How much 
of the inherent diversity gain can this scheme get? Quantify the performance loss from 
optimal combining. Hint: You may find the techniques developed in Exercise 3.2 useful 
for this problem. 

Exercise 3.15. It is suggested that full diversity gain can be achieved over a Rayleigh 
faded MISO channel by simply transmitting the same symbol at each of the transmit 
antennas simultaneously. Is it correct? 

Exercise 3.16. ALxl MISO channel can be converted into a time-diversity channel 
with L diversity branches by simply transmitting over one antenna at a time. 

1. In this way, any code designed for a time-diversity channel with L diversity 
branches can be used for a MISO (multi-input single-output) channel with L 
transmit antennas. If the code achieves /e-fold diversity in the time-diversity 
channel, how much diversity can it obtain in the MISO channel? What is the 
relationship between the minimum product distance metric of the code when 
viewed as a time-diversity code and its minimum determinant metric when viewed 
as a transmit-diversity code? 

2. Using this transformation, the rotation code can be used as a transmit diversity 
scheme. Compare the performance of this code and the Alamouti scheme in a 
2x1 Rayleigh fading channel, using BPSK symbols. Which one is better? How 
about using QPSK symbols? 

3. Use the permutation code (c.f. Figure 3.26) from Exercise 3.10 on a 2 x 1 Rayleigh 
fading channel and compare its performance with the Alamouti scheme using 
QPSK symbols (so the rate is the same in both the schemes). 

Exercise 3.17. In this exercise, we derive some properties a code construction must 
satisfy to mimic the Alamouti scheme behavior for more than 2 transmit antennas. 
Consider communication over n time slots on the L transmit antenna channel (c.f. 
(3.80)): 



= h*X + w‘. 



(3.164) 




